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LEARNING OUTCOME STATEMENTS (LOS) 
1. Rates and Returns 
The candidate should be able to: 
a. interpret interest rates as required rates of return, discount rates, or 
Opportunity costs and explain an interest rate as the sum of a real risk-free rate 
and premiums that compensate investors for bearing distinct types of risk. 


b. calculate and interpret different approaches to return measurement over time 
and describe their appropriate uses. 


c. compare the money-weighted and time-weighted rates of return and evaluate 
the performance of portfolios based on these measures. 


d. calculate and interpret annualized return measures and continuously 
compounded returns, and describe their appropriate uses. 


e. calculate and interpret major return measures and describe their appropriate 
uses. 


2. The Time Value of Money in Finance 
The candidate should be able to: 


a. calculate and interpret the present value (PV) of fixed-income and equity 
instruments based on expected future cash flows. 


b. calculate and interpret the implied return of fixed-income instruments and 
required return and implied growth of equity instruments given the present 
value (PV) and cash flows. 


c. explain the cash flow additivity principle, its importance for the no-arbitrage 
condition, and its use in calculating implied forward interest rates, forward 
exchange rates, and option values. 

3. Statistical Measures of Asset Returns 

The candidate should be able to: 

a. calculate, interpret, and evaluate measures of central tendency and location to 
address an investment problem. 


b. calculate, interpret, and evaluate measures of dispersion to address an 
investment problem. 


c. interpret and evaluate measures of skewness and kurtosis to address an 
investment problem. 


d. interpret correlation between two variables to address an investment problem. 
4. Probability Trees and Conditional Expectations 
The candidate should be able to: 
a. calculate expected values, variances, and standard deviations and demonstrate 
their application to investment problems. 


b. formulate an investment problem as a probability tree and explain the use of 
conditional expectations in investment application. 


c. calculate and interpret an updated probability in an investment setting using 
Bayes’ formula. 
5. Portfolio Mathematics 
The candidate should be able to: 
a. calculate and interpret the expected value, variance, standard deviation, 
covariances, and correlations of portfolio returns. 


b. calculate and interpret the covariance and correlation of portfolio returns using 
a joint probability function for returns. 


c. define shortfall risk, calculate the safety-first ratio, and identify an optimal 
portfolio using Roy’s safety-first criterion. 
6. Simulation Methods 
The candidate should be able to: 
a. explain the relationship between normal and lognormal distributions and why 
the lognormal distribution is used to model asset prices when using 
continuously compounded asset returns. 


b. describe Monte Carlo simulation and explain how it can be used in investment 
applications. 


c. describe the use of bootstrap resampling in conducting a simulation based on 
observed data in investment applications. 
7. Estimation and Inference 
The candidate should be able to: 
a. compare and contrast simple random, stratified random, cluster, convenience, 
and judgmental sampling and their implications for sampling error in an 
investment problem. 


b. explain the central limit theorem and its importance for the distribution and 
standard error of the sample mean. 


c. describe the use of resampling (bootstrap, jackknife) to estimate the sampling 
distribution of a statistic. 
8. Hypothesis Testing 
The candidate should be able to: 
a. explain hypothesis testing and its components, including statistical significance, 
Type I and Type II errors, and the power of a test. 


b. construct hypothesis tests and determine their statistical significance, the 
associated Type I and Type II errors, and power of the test given a significance 
level. 


c. compare and contrast parametric and nonparametric tests, and describe 
situations where each is the more appropriate type of test. 


9. Parametric and Non-Parametric Tests of Independence 
The candidate should be able to: 


a. explain parametric and nonparametric tests of the hypothesis that the 
population correlation coefficient equals zero, and determine whether the 
hypothesis is rejected at a given level of significance. 

b. explain tests of independence based on contingency table data. 

10. Simple Linear Regression 
The candidate should be able to: 
a. describe a simple linear regression model, how the least squares criterion is 


used to estimate regression coefficients, and the interpretation of these 
coefficients. 


b. explain the assumptions underlying the simple linear regression model, and 
describe how residuals and residual plots indicate if these assumptions may 
have been violated. 


c. calculate and interpret measures of fit and formulate and evaluate tests of fit 
and of regression coefficients in a simple linear regression. 


d. describe the use of analysis of variance (ANOVA) in regression analysis, 
interpret ANOVA results, and calculate and interpret the standard error of 
estimate in a simple linear regression. 


e. calculate and interpret the predicted value for the dependent variable, and a 
prediction interval for it, given an estimated linear regression model and a value 
for the independent variable. 


f. describe different functional forms of simple linear regressions. 
11. Introduction to Big Data Techniques 
The candidate should be able to: 


a. describe aspects of “fintech” that are directly relevant for the gathering and 
analyzing of financial data. 


b. describe Big Data, artificial intelligence, and machine learning. 

c. describe applications of Big Data and Data Science to investment management. 
12. Firms and Market Structures 

The candidate should be able to: 

a. determine and interpret breakeven and shutdown points of production, as well 


as how economies and diseconomies of scale affect costs under perfect and 
imperfect competition. 


b. describe characteristics of perfect competition, monopolistic competition, 
oligopoly, and pure monopoly. 

c. explain supply and demand relationships under monopolistic competition, 
including the optimal price and output for firms as well as pricing strategy. 

d. explain supply and demand relationships under oligopoly, including the optimal 
price and output for firms as well as pricing strategy. 


e. identify the type of market structure within which a firm operates and describe 
the use and limitations of concentration measures. 


13. Understanding Business Cycles 
The candidate should be able to: 


a. describe the business cycle and its phases. 
b. describe credit cycles. 
c. describe how resource use, consumer and business activity, housing sector 


activity, and external trade sector activity vary over the business cycle and 
describe their measurement using economic indicators. 


14. Fiscal Policy 
The candidate should be able to: 


a. compare monetary and fiscal policy. 


b. describe roles and objectives of fiscal policy as well as arguments as to whether 
the size of a national debt relative to GDP matters. 


c. describe tools of fiscal policy, including their advantages and disadvantages. 


d. explain the implementation of fiscal policy and difficulties of implementation as 
well as whether a fiscal policy is expansionary or contractionary. 


15. Monetary Policy 
The candidate should be able to: 


a. describe the roles and objectives of central banks. 


b. describe tools used to implement monetary policy tools and the monetary 
transmission mechanism, and explain the relationships between monetary 
policy and economic growth, inflation, interest, and exchange rates. 


c. describe qualities of effective central banks; contrast their use of inflation, 
interest rate, and exchange rate targeting in expansionary or contractionary 
monetary policy; and describe the limitations of monetary policy. 


d. explain the interaction of monetary and fiscal policy. 
16. Introduction to Geopolitics 


The candidate should be able to: 
a. describe geopolitics from a cooperation versus competition perspective. 


b. describe geopolitics and its relationship with globalization. 


c. describe functions and objectives of the international organizations that 
facilitate trade, including the World Bank, the International Monetary Fund, and 
the World Trade Organization. 


d. describe geopolitical risk. 
e. describe tools of geopolitics and their impact on regions and economies. 


f. describe the impact of geopolitical risk on investments. 
17. International Trade 
The candidate should be able to: 
a. describe the benefits and costs of international trade. 


b. compare types of trade restrictions, such as tariffs, quotas, and export subsidies, 
and their economic implications. 


c. explain motivations for and advantages of trading blocs, common markets, and 
economic unions. 
18. Capital Flows and the FX Market 
The candidate should be able to: 
a. describe the foreign exchange market, including its functions and participants, 
distinguish between nominal and real exchange rates, and calculate and 
interpret the percentage change in a currency relative to another currency. 


b. describe exchange rate regimes and explain the effects of exchange rates on 
countries’ international trade and capital flows. 


c. describe common objectives of capital restrictions imposed by governments. 


19. Exchange Rate Calculations 
The candidate should be able to: 


a. calculate and interpret currency cross-rates. 


b. explain the arbitrage relationship between spot and forward exchange rates and 
interest rates, calculate a forward rate using points or in percentage terms, and 
interpret a forward discount or premium. 


READING 1 
RATES AND RETURNS 


MODULE 1.1: INTEREST RATES AND Gg 
RETURN MEASUREMENT 


Video covering 
this content is 


LOS 1.a: Interpret interest rates as required rates of return, available online. 
discount rates, or opportunity costs and explain an interest rate 

as the sum of a real risk-free rate and premiums that compensate investors 
for bearing distinct types of risk. 


Interest rates measure the time value of money, although risk differences in financial 
securities lead to differences in their equilibrium interest rates. Equilibrium interest 
rates are the required rate of return for a particular investment, in the sense that 
the market rate of return is the return that investors and savers require to get them 
to willingly lend their funds. Interest rates are also referred to as discount rates 
and, in fact, the terms are often used interchangeably. If an individual can borrow 
funds at an interest rate of 10%, then that individual should discount payments to be 
made in the future at that rate to get their equivalent value in current dollars or 
other currencies. Finally, we can also view interest rates as the opportunity cost of 
current consumption. If the market rate of interest on 1-year securities is 5%, 
earning an additional 5% is the opportunity forgone when current consumption is 
chosen rather than saving (postponing consumption). 


The real risk-free rate of interest is a theoretical rate on a single-period loan that 
contains no expectation of inflation and zero probability of default. What the real 
risk-free rate represents in economic terms is time preference, the degree to which 
current consumption is preferred to equal future consumption. 


When we speak of a real rate of return, we are referring to an investor’s increase in 
purchasing power (after adjusting for inflation). Because expected inflation in future 
periods is not zero, the rates we observe on USS. Treasury bills (T-bills), for example, 
are essentially risk-free rates, but not real rates of return. T-bill rates are nominal 
risk-free rates because they contain an inflation premium. This is the relation: 


(1 + nominal risk-free rate) = (1 + real risk-free rate)(1 + expected inflation rate) 


Often, including in many parts of the CFA curriculum, this relation is approximated 
as follows: 


nominal risk-free rate ~ real risk-free rate + expected inflation rate 


Securities may have one or more types of risk, and each added risk increases the 
required rate of return. These types of risks are as follows: 


« Default risk. This is the risk that a borrower will not make the promised 
payments in a timely manner. 


» Liquidity risk. This is the risk of receiving less than fair value for an investment if 
it must be sold quickly for cash. 


» Maturity risk. As we will see in the Fixed Income topic area, the prices of longer- 
term bonds are more volatile than those of shorter-term bonds. Longer-maturity 
bonds have more maturity risk than shorter-term bonds and require a maturity 
risk premium. 


Each of these risk factors is associated with a risk premium that we add to the 
nominal risk-free rate to adjust for greater default risk, less liquidity, and longer 
maturity relative to a liquid, short-term, default risk-free rate such as that on T-bills. 
We can write the following: 


nominal rate of interest = real risk-free rate 
+ inflation premium 
+ default risk premium 
+ liquidity premium 


+ maturity premium 


LOS 1.b: Calculate and interpret different approaches to return measurement 
over time and describe their appropriate uses. 


Holding period return (HPR) is simply the percentage increase in the value of an 
investment over a given period: 


end-of-period value 


holding period return ap eee 


For example, a stock that pays a dividend during a holding period has an HPR for 
that period equal to: 
P, + Div, P, — Py + Div, 
i a "ae 
If a stock is valued at €20 at the beginning of the period, pays €1 in dividends over 
the period, and at the end of the period is valued at €22, the HPR is: 
HPR = (22 +1) /20-1=0.15 = 15% 


Returns over multiple periods reflect compounding. For example, given HPRs for 
Years 1, 2, and 3, the HPR for the entire three-year period is: 


HPR = (1 + HPRy,., ;)(1 + HPRy.. 2)(1 + HPRy,..3)— 1 


Later in this reading, we will see that a return over multiple years is typically stated 
as an annualized return rather than an HPR. 


Average Returns 


The arithmetic mean return is the simple average of a series of periodic returns. It 
has the statistical property of being an unbiased estimator of the true mean of the 


underlying distribution of returns: 


: (R, + R,+R,+..+R,) 
arithmetic mean return a 


The geometric mean return is a compound rate. When periodic rates of return 
vary from period to period, the geometric mean return will have a value less than 
the arithmetic mean return: 

geometric mean return (1+ R,) x (1+ R,) x (1+R,) x... x (1+R,)-1 


For example, for returns R, over three annual periods, the geometric mean return is 
calculated as the following example shows. 


EXAMPLE: Geometric mean return 


For the last three years, the returns for Acme Corporation common stock have 
been —9.34%, 23.45%, and 8.92%. Calculate the compound annual rate of return 
over the three-year period. 

Answer: 


YT — 0.0934) > 


x (1 + 0.2345) x (1 + 0.0892) — 1 


=~/1.21903 — 1 
Rg = 1.06825 — 1 = 6.825% 
Solve this type of problem with your calculator as follows: 
» On the TI, enter 1.21903 [y*] 3 [1/x] [=] 
» On the HP, enter 1.21903 [ENTER] 3 [1/x] [y*] 


PROFESSOR’S NOTE 
The geometric mean is always less than or equal to the arithmetic mean, 
and the difference increases as the dispersion of the observations 
increases. The only time the arithmetic and geometric means are equal is 
when there is no variability in the observations (i.e., all observations are 
equal). 

A harmonic mean is used for certain computations, such as the averaee cost of 


shares purchased over time. The harmonic mean is calculated as > 1 where there 
Xx. 
a a 


are N values of X;. 


EXAMPLE: Calculating average cost with the harmonic mean 


An investor purchases $1,000 of mutual fund shares each month, and over the last 
three months, the prices paid per share were $8, $9, and $10. What is the average 
cost per share? 


Answer: 
Xu= ——3 ___ — $8.926 per share 
Le eS 
a9 = 10 


To check this result, calculate the total shares purchased as follows: 


1,000 1,000 1,000 
a 7s ao Bam 336.11 shares 


$3,000 
336.11 


The average price is = $8.926 per share. 


The previous example illustrates the interpretation of the harmonic mean in its 
most common application. Note that the average price paid per share ($8.93) is less 


than the arithmetic average of the share prices, which is oe 9. 


The relationship among arithmetic, geometric, and harmonic means can be stated as 
follows: 


arithmetic mean x harmonic mean = (geometric mean)* 


PROFESSOR’S NOTE 
The proof of this is beyond the scope of the Level I exam. 


For values that are not all equal, harmonic mean < geometric mean < arithmetic 
mean. This mathematical fact is the basis for the claimed benefit of purchasing the 
same money amount of mutual fund shares each month or each week. Some refer to 
this practice as cost averaging. 


Measures of average return can be affected by outliers, which are unusual 
observations in a dataset. Two of the methods for dealing with outliers are a 
trimmed mean and a winsorized mean. We will examine these in our reading on 
Statistical Measures of Asset Returns. 


Appropriate uses for the various return measures are as follows: 


« Arithmetic mean. Include all values, including outliers. 
=» Geometric mean. Compound the rate of returns over multiple periods. 


» Harmonic mean. Calculate the average share cost from periodic purchases ina 
fixed money amount. 


« Trimmed or winsorized mean. Decrease the effect of outliers. 


MODULE QUIZ 1.1 


=) 
1. An interest rate is best interpreted as a: 
A. discount rate or a measure of risk. 
B. measure of risk or a required rate of return. 


C. required rate of return or the opportunity cost of consumption. 


2. An interest rate from which the inflation premium has been subtracted is known 
as a: 
A. real interest rate. 
B. risk-free interest rate. 
C. real risk-free interest rate. 


3. The harmonic mean of 38, 4, and 5 is: 
A. 3.74. 
B. 3.83. 
C. 4.12. 


4. XYZ Corp. Annual Stock Returns 


Year 20X1 20X2 20X3 20X4 20X5 20X6 
Return 22% 5% 7% 11% 2% £11% 


The mean annual return on XYZ stock is most appropriately calculated using the: 
A. harmonic mean. 

B. arithmetic mean. 

C. geometric mean. 


MODULE 1.2: TIME-WEIGHTED AND C) 
MONEY-WEIGHTED RETURNS 


Video covering 
this content is 
available online. 
LOS 1.c: Compare the money-weighted and time-weighted rates 


of return and evaluate the performance of portfolios based on 
these measures. 


The money-weighted return applies the concept of the internal rate of return 
(IRR) to investment portfolios. An IRR is the interest rate at which a series of cash 
inflows and outflows sum to zero when discounted to their present value. That is, 
they have a net present value (NPV) of zero. The IRR and NPV are built-in 
functions on financial calculators that CFA Institute permits candidates to use for 
the exam. 


PROFESSOR’S NOTE 

We have provided an online video in the Resource Library on how to use 
the TI calculator. You can view it by logging in to your account at 
www.schweser.com. 


The money-weighted rate of return is defined as the IRR on a portfolio, taking 
into account all cash inflows and outflows. The beginning value of the account is an 
inflow, as are all deposits into the account. All withdrawals from the account are 
outflows, as is the ending value. 


EXAMPLE: Money-weighted rate of return 


Assume an investor buys a share of stock for $100 at t = 0, and at the end of the 
year (t = 1), she buys an additional share for $120. At the end of Year 2, the 
investor sells both shares for $130 each. At the end of each year in the holding 
period, the stock paid a $2 per share dividend. What is the money-weighted rate 
of return? 


Step I: Determine the timing of each cash flow and whether the cash flow is 
an inflow (+), into the account, or an outflow (—), available from the 


account. 
t= 0: purchase of first share = +$100.00 inflow to account 
t= 1: purchase of second share = +$120.00 
dividends from firstshare = -—$2.00 
subtotal, t= 1 +$118.00 inflow to account 
t= 2: dividend fromtwo shares = -—$4.00 
proceeds from selling shares = —$260.00 
subtotal, t= 2 —$264.00 outflow from account 


Step 2: Net the cash flows for each period and set the PV of cash inflows 
equal to the PV of cash outflows. 


ae a SP ea 
$118 _ _ $264 
(l+r) (141)? 
Step 3: Solve for r to find the money-weighted rate of return. This can be 


done using trial and error or by using the IRR function on a financial 
calculator or spreadsheet. 


The intuition here is that we deposited $100 into the account at t = 0, then added 
$118 to the account at t = 1 (which, with the $2 dividend, funded the purchase of 
one more share at $120), and ended with a total value of $264. 


$100 + 


To compute this value with a financial calculator, use these net cash flows and 
follow the procedure(s) described to calculate the IRR: 
net cash flows: CF, = +100; CF, = +120 — 2 = +118; 
CF, = —260 + —4 = —264 


Calculating money-weighted return with the TI Business Analyst II Plus® 


Note the values for F01, F0O2, and so on, are all equal to 1. 


Keystrokes Explanation Display 

" [CF] [294][(CLR WORK] Clear cash flow registers CFO=0.00000 
100 [ENTER] Initial cash outlay CFO = +100.00000 
[|] 118 [ENTER] Period 1 cash flow C01 = +118.00000 
[1] [1] 264 [+/—] [ENTER] Period 2 cash flow C02 = —264.00000 
[IRR] [CPT] Calculate IRR IRR = 13.86122 


The money-weighted rate of return for this problem is 13.86%. 


PROFESSOR’S NOTE 

In the preceding example, we entered the flows into the account as a 
positive and the ending value as a negative (the investor could withdraw 
this amount from the account). Note that there is no difference in the 
solution if we enter the cash flows into the account as negative values 
(out of the investor's pocket) and the ending value as a positive value 
(into the investor’s pocket). As long as payments into the account and 
payments out of the account (including the ending value) are entered 
with opposite signs, the computed IRR will be correct. 


Time-weighted rate of return measures compound growth and is the rate at which 
$1 compounds over a specified performance horizon. Time-weighting is the process 
of averaging a set of values over time. The annual time-weighted return for an 
investment may be computed by performing the following steps: 

Step 1: Value the portfolio immediately preceding significant additions or withdrawals. Form 


subperiods over the evaluation period that correspond to the dates of deposits and 
withdrawals. 


Step 2: Compute the holding period return (HPR) of the portfolio for each subperiod. 


Step 3: Compute the product of (1 + HPR) for each subperiod to obtain a total return for the entire 
measurement period [i-e., (1 + HPRy) x (1 + HPR2) ... (1 + HPRy)] - 1. If the total investment 


period is greater than one year, you must take the geometric mean of the measurement 
period return to find the annual time-weighted rate of return. 


EXAMPLE: Time-weighted rate of return 


An investor purchases a share of stock at t = 0 for $100. At the end of the year, t = 
1, the investor buys another share of the same stock for $120. At the end of Year 2, 
the investor sells both shares for $130 each. At the end of both Years 1 and 2, the 
stock paid a $2 per share dividend. What is the annual time-weighted rate of 
return for this investment? (This is the same investment as the preceding 
example.) 


Answer: 


Step I: Break the evaluation period into two subperiods based on timing 
of cash flows. 


Holding period 1: Beginning value = $100 
Dividends paid — $2 
Ending value = $120 
Holding period 2: Beginning value = $240 (2 shares) 
Dividends paid = $4($2 per share) 
Ending value = $260 (2 shares) 
Step 2: Calculate the HPR for each holding period. 
HPR, = [($120 + 2)/$100] — 1 = 22% 
HPR, = [($260 + 4)/$240] — 1 = 10% 


Step 3: Find the compound annual rate that would have produced a total 
return equal to the return on the account over the two-year period. 


(1 + time-weighted rate of return)? = (1.22)(1.10) 
time-weighted rate of return = [(1.22)(1.10)]®> — 1 = 15.84% 


The time-weighted rate of return is not affected by the timing of cash inflows and 
outflows. In the investment management industry, time-weighted return is the 
preferred method of performance measurement because portfolio managers 
typically do not control the timing of deposits to and withdrawals from the accounts 
they manage. 


In the preceding examples, the time-weighted rate of return for the portfolio was 
15.84%, while the money-weighted rate of return for the same portfolio was 13.86%. 
The results are different because the money-weighted rate of return gave a larger 


weight to the Year 2 HPR, which was 10%, versus the 22% HPR for Year 1. This is 
because there was more money in the account at the beginning of the second period. 


If funds are contributed to an investment portfolio just before a period of relatively 
poor portfolio performance, the money-weighted rate of return will tend to be lower 
than the time-weighted rate of return. On the other hand, if funds are contributed to 
a portfolio at a favorable time (just before a period of relatively high returns), the 
money-weighted rate of return will be higher than the time-weighted rate of return. 
The use of the time-weighted return removes these distortions, and thus provides a 
better measure of a manager’s ability to select investments over the period. If the 
manager has complete control over money flows into and out of an account, the 
money-weighted rate of return would be the more appropriate performance 
measure. 


=I MODULE QUIZ 1.2 


1. An investor buys a share of stock for $40 at time t = 0, buys another share of the 
same stock for $50 at t= 1, and sells both shares for $60 each at t = 2. The stock 
paid a dividend of $1 per share at ¢ = 1 and at ¢ = 2. The periodic money-weighted 
rate of return on the investment is closest to: 

A. 22.2%. 
B. 23.0%. 
C. 23.8%. 


2. An investor buys a share of stock for $40 at time ¢ = 0, buys another share of the 
same stock for $50 at t= 1, and sells both shares for $60 each at t = 2. The stock 
paid a dividend of $1 per share at t= 1 and at t = 2. The time-weighted rate of 
return on the investment for the period is closest to: 

A. 24.7%. 
B. 25.7%. 
C. 26.8%. 


MODULE 1.3: COMMON MEASURES OF (>) 
RETURN 


Video covering 
this content is 


. . available online. 
LOS 1.d: Calculate and interpret annualized return measures 


and continuously compounded returns, and describe their 
appropriate uses. 


Interest rates and market returns are typically stated as annualized returns, 
regardless of the actual length of the time period over which they occur. To 
annualize an HPR that is realized over a specific number of days, use the following 
formula: 


annualized return = (1 + HPR)*°/** — 1 


EXAMPLE: Annualized return, shorter than one year 


A saver deposits $100 into a bank account. After 90 days, the account balance is 
$100.75. What is the saver’s annualized rate of return? 


Answer: 


100.75 _ " 
HPR = “997 — 1 = 0.0075 = 0.75% 


annualized return = (1 + 0.0075)3®/% — 1 = 0.0308 = 3.08% 


EXAMPLE: Annualized return, longer than one year 


An investor buys a 500-day government bill for $970 and redeems it at maturity 
for $1,000. What is the investor’s annualized return? 


Answer: 


1000 
= 0.0309 = 3.099 
HPR = O70 — 1= 0.0309 = 3.09% 


annualized return = (1 + 0.0309)3/909 — 1 = 0.0225 = 2.25% 


In time value of money calculations (which we will address in more detail in our 
reading on The Time Value of Money in Finance), more frequent compounding has an 
impact on future value and present value computations. Specifically, because an 
increase in the frequency of compounding increases the effective interest rate, it also 
increases the future value of a given cash flow and decreases the present value of a 
given cash flow. 


This is the general formula for the present value of a future cash flow: 
mN 
PV =FV,(1+ 4) 


where: 

r = quoted annual interest rate 

N = number of years 

m = compounding periods per year 


EXAMPLE: The effect of compounding frequency on FV and PV 


Compute the PV of $1,000 to be received one year from now using a stated annual 
interest rate of 6% with a range of compounding periods. 


Answer: 
With semiannual compounding, m = 2: 
PV = 1,000(1 + 298) ~* _ 942.60 
With quarterly compounding, m = 4: 
PV = 1,000(1 + 2:00) * 942.18 
With monthly compounding, m = 12: 
PV = 1,000(1 + 900) * _ 941.91 
With daily compounding, m = 365: 
PV = 1,000(1 + S22) © 941.77 
Compounding Frequency Effect 


Compounding Interest Rate Present 


Frequency per Period Value 

Semiannual (m= 2) 3.000 942.60 
Quarterly (m = 4) 1.500 942.18 
Monthly (m = 12) 0.500 941.91 
Daily (m = 365) 0.016438 941.77 


The mathematical limit of shortening the compounding period is known as 
continuous compounding. Given an HPR, we can use the natural logarithm (In, or LN 
on your financial calculator) to state its associated continuously compounded 
return: 


‘ ending value ° 
= In(1 + HPR) = inj ———————_ 
“x Uf ; ( beginning value ) 


Notice that because the calculation is based on 1 plus the HPR, we can also perform 
it directly from the price relative. The price relative is just the end-of-period value 
divided by the beginning-of-period value. 


EXAMPLE: Calculating continuously compounded returns 


A stock was purchased for $100 and sold one year later for $120. Calculate the 
investor’s annual rate of return on a continuously compounded basis. 


Answer: 


In( 120) = 18.232% 


If we had been given the return (20%) instead, the calculation is this: 


In (1 + 0.20) = 18.232% 


A useful property of continuously compounded rates of return is that they are 
additive for multiple periods. That is, a continuously compounded return from t = 0 
to t= 2 is the sum of the continuously compounded return from t = 0 to t= 1 and the 
continuously compounded return from t= 1 to t= 2. 


LOS 1.e: Calculate and interpret major return measures and describe their 
appropriate uses. 


Gross return refers to the total return on a security portfolio before deducting fees 
for the management and administration of the investment account. Net return 
refers to the return after these fees have been deducted. Commissions on trades and 
other costs that are necessary to generate the investment returns are deducted in 
both gross and net return measures. 


Pretax nominal return refers to the return before paying taxes. Dividend income, 
interest income, short-term capital gains, and long-term capital gains may all be 


taxed at different rates. After-tax nominal return refers to the return after the tax 
liability is deducted. 


Real return is nominal return adjusted for inflation. Consider an investor who earns 
a nominal return of 7% over a year when inflation is 2%. The investor’s approximate 
real return is simply 7 — 2 = 5%. The investor’s exact real return is slightly lower: 
1.07 / 1.02 - 1 = 0.049 = 4.9%. 


Using the components of an interest rate we defined earlier, we can state a real 
return as follows: 


(1 + nominal risk-free rate)(1 + risk premium) 


a aac (1 + inflation premium) 


PROFESSOR’S NOTE 

The Level I curriculum states this relationship as 

(jt? ae 
(1 + inflation premium) 


Stating it this way assumes the risk premium includes inflation risk. 


Real return measures the increase in an investor’s purchasing power—how much 
more goods she can purchase at the end of one year due to the increase in the value 
of her investments. If she invests $1,000 and earns a nominal return of 7%, she will 
have $1,070 at the end of the year. If the price of the goods she consumes has gone 
up 2%, from $1.00 to $1.02, she will be able to consume 1,070 / 1.02 = 1,049 units. 
She has given up consuming 1,000 units today, but instead, she is able to purchase 
1,049 units at the end of one year. Her purchasing power has gone up 4.9%; this is 
her real return. 


A leveraged return refers to a return to an investor that is a multiple of the return 
on the underlying asset. The leveraged return is calculated as the gain or loss on the 
investment as a percentage of an investor’s cash investment. An investment in a 
derivative security, such as a futures contract, produces a leveraged return because 
the cash deposited is only a fraction of the value of the assets underlying the futures 
contract. Leveraged investments in real estate are common: investors pay only a 
portion of a property’s cost in cash and borrow the rest. 


To illustrate the effect of leverage on returns, consider a fund that can invest the 
amount V, without leverage, and earn the rate of return r. The fund’s unleveraged 
return (as a money amount) is simply r x Vo. Now let’s say this fund can borrow the 
amount V, at an interest rate of rp, and earn r by investing the proceeds. The fund’s 
leveraged return (again as a money amount), after subtracting the interest cost, then 
becomes r x (Vo + Vg) - (Tp * Vp). 


Thus, stated as a rate of return on the initial value of Vo, the leveraged rate of return 
is as follows: 
1(Vo + Vz) — tBVp 


leveraged return v 


0 


MODULE QUIZ 1.3 
—=h 


1. If an investment loses 3% of its value over 120 days, its annualized return is 
closest to: 


A. -8.0%. 

B. -8.5%. 

C. -9.0%. 

2. If a stock’s initial price is $20 and its price increases to $28, its continuously 

compounded rate of return is closest to: 

A. 13.64%. 

B. 13.98%. 

C. 15.00%. 


3. The value of an investment increases 5% before commissions and fees. This 5% 
increase represents: 
A. the investment’s net return. 
B. the investment’s gross return. 
C. neither the investment’s gross return nor its net return. 


KEY CONCEPTS 


LOS 1.a 


An interest rate can be interpreted as the rate of return required in equilibrium for a 
particular investment, the discount rate for calculating the present value of future 
cash flows, or as the opportunity cost of consuming now, rather than saving and 
investing. 


The real risk-free rate reflects time preference for present goods versus future 
goods. Nominal risk-free rate ~ real risk-free rate + expected inflation rate. 


Securities may have several risks, and each increases the required rate of return. 
These include default risk, liquidity risk, and maturity risk. 


We can view a nominal interest rate as the sum of a real risk-free rate, expected 
inflation, a default risk premium, a liquidity premium, and a maturity premium. 


LOS 1.b 


Holding period return is used to measure an investment’s return over a specific 
period. Arithmetic mean return is the simple average of a series of periodic returns. 
Geometric mean return is a compound annual rate. 


Arithmetic mean return includes all observations, including outliers. Geometric 
mean return is used for compound returns over multiple periods. Harmonic mean is 
used to calculate the average price paid with equal periodic investments. Trimmed 
mean or winsorized mean are used to reduce the effect of outliers. 


LOS 1.c 


The money-weighted rate of return is the IRR calculated using periodic cash flows 
into and out of an account and is the discount rate that makes the PV of cash inflows 
equal to the PV of cash outflows. 


The time-weighted rate of return measures compound growth and is the rate at 
which money compounds over a specified performance horizon. 


If funds are added to a portfolio just before a period of poor performance, the 
money-weighted return will be lower than the time-weighted return. If funds are 


added just before a period of high returns, the money-weighted return will be higher 
than the time-weighted return. 


The time-weighted return is the preferred measure of a manager’s ability to select 
investments. If the manager controls the money flows into and out of an account, the 
money-weighted return is the more appropriate performance measure. 


LOS 1.d 


Interest rates and market returns are typically stated on an annualized basis: 


annualized return = (1 + HPR)*°/¥5 — 1 


Given a holding period return, this is the associated continuously compounded 
return: 


cc 


R.. = In(1 + HPR) = in( —— ) 


beginning value 
LOS 1.e 


Gross return is the total return after deducting commissions on trades and other 
costs necessary to generate the returns, but before deducting fees for the 
management and administration of the investment account. Net return is the return 
after management and administration fees have been deducted. 


Pretax nominal return is the numerical percentage return of an investment, without 
considering the effects of taxes and inflation. After-tax nominal return is the 
numerical return after the tax liability is deducted, without adjusting for inflation. 


Real return is the increase in an investor’s purchasing power, roughly equal to 
nominal return minus inflation. 


Leveraged return is the gain or loss on an investment as a percentage of an investor’s 
cash investment. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 1.1 


1.C Interest rates can be interpreted as required rates of return, discount rates, or 
Opportunity costs of current consumption. A risk premium can be, but is not 
always, a component of an interest rate. (LOS 1.a) 


2.A Real interest rates are those that have been adjusted for inflation. (LOS 1.a) 


3 3 
3.B *H>] ia 
ht 4a 
(LOS 1.b) 


4.C Because returns are compounded, the geometric mean is appropriate. 
((1.22)(1.05)(0.931.119(1.02(1.11)] ”° — 1 = 6.96% 


(LOS 1b) 


Module Quiz 1.2 


1.C Using the cash flow functions on your financial calculator, enter CFo = -40; CF, 
=-50 + 1=-49; CF, = 60 x 2 + 2 = 122; and CPT IRR = 23.82%. 


(LOS 1.c) 
2.B HPR, = 22+! 127.5% 
40 
60+2 _ ‘ 
HPR, = © * = _ 1 = 24.0% 
TWR = V1 + 0.2751 + 0.24) — 1 = 25.74% 
(LOS 1.c) 


Module Quiz 1.3 


1.B Annualized return = (1 — 0.03)969/129 _ 1 = — 0.0885 = —8.55% 
(LOS 1.d) 


2.B In (23/20) = 0.1398 
(LOS 1.4) 


3.C Gross return is the total return after deducting commissions on trades and 
other costs necessary to generate the returns, but before deducting fees for the 
management and administration of the investment account. Net return is the 
return after management and administration fees have been deducted. (LOS 1.e) 


READING 2 


THE TIME VALUE OF MONEY IN 
FINANCE 


PROFESSOR’S NOTE 

The examples we use in this reading are meant to show how the time 
value of money appears throughout finance. Don’t worry if you are not 
yet familiar with the securities we describe in this reading. We will see 
these examples again when we cover bonds and forward interest rates in 
Fixed Income, stocks in Equity Investments, foreign exchange in 
Economics, and options in Derivatives. 


WARM-UP: USING A FINANCIAL CALCULATOR 


For the exam, you must be able to use a financial calculator when working time 
value of money problems. You simply do not have the time to solve these problems 
any other way. 


CFA Institute allows only two types of calculators to be used for the exam: (1) the 
Texas Instruments® TI BA II Plus” (including the BA II Plus Professional”) and (2) 
the HP® 12C (including the HP 12C Platinum). This reading is written primarily 
with the TI BA II Plus in mind. If you do not already own a calculator, purchase a TI 
BA II Plus! However, if you already own the HP 12C and are comfortable with it, by 
all means, continue to use it. 


Before we begin working with financial calculators, you should familiarize yourself 
with your TI BA II Plus by locating the keys noted below. These are the only keys 
you need to know to calculate virtually all of the time value of money problems: 

=» N =number of compounding periods 

» I/Y =interest rate per compounding period 

=» PV =present value 

« FV = future value 

» PMT = annuity payments, or constant periodic cash flow 

=» CPT =compute 

The TI BA II Plus comes preloaded from the factory with the periods per year 
function (P/Y) set to 12. This automatically converts the annual interest rate (I/Y) 
into monthly rates. While appropriate for many loan-type problems, this feature is 
not suitable for the vast majority of the time value of money applications we will be 
studying. So, before using our SchweserNotes™, please set your P/Y key to “1” using 
the following sequence of keystrokes: 


[2nd] [P/Y] “1" [ENTER] [2nd] [QUIT] 


As long as you do not change the P/Y setting, it will remain set at one period per 
year until the battery from your calculator is removed (it does not change when you 
turn the calculator on and off). If you want to check this setting at any time, press 
[2nd] [P/Y]. The display should read P/Y = 1.0. If it does, press [2nd] [QUIT] to get 
out of the “programming” mode. If it does not, repeat the procedure previously 
described to set the P/Y key. With P/Y set to equal 1, it is now possible to think of 
I/Y as the interest rate per compounding period and N as the number of 
compounding periods under analysis. Thinking of these keys in this way should help 
you keep things straight as we work through time value of money problems. 


PROFESSOR’S NOTE 

We have provided an online video in the Resource Library on how to use 
the TI calculator. You can view it by logging in to your account at 
www.schweser.com. 


MODULE 2.1: DISCOUNTED CASH FLOW [>] 
VALUATION 


Video covering 
this content is 
available online. 


LOS 2.a: Calculate and interpret the present value (PV) of fixed- 
income and equity instruments based on expected future cash flows. 


In our Rates and Returns reading, we gave examples of the relationship between 
present values and future values. We can simplify that relationship as follows: 


FV = PV(1 +1) 


PV wo FV(1 +1) 


where: 
r = interest rate per compounding period 
t = number of compounding periods 
If we are using continuous compounding, this is the relationship: 
FV = PV x 


PV =FVxe™* 


Fixed-Income Securities 


One of the simplest examples of the time value of money concept is a pure discount 
debt instrument, such as a zero-coupon bond. With a pure discount instrument, the 
investor pays less than the face value to buy the instrument and receives the face 
value at maturity. The price the investor pays depends on the instrument’s yield to 
maturity (the discount rate applied to the face value) and the time until maturity. 
The amount of interest the investor earns is the difference between the face value 
and the purchase price. 


| EXAMPLE: Zero-coupon bond 


A zero-coupon bond with a face value of $1,000 will mature 15 years from today. 
The bond has a yield to maturity of 4%. Assuming annual compounding, what is 
the bond’s price? 


Answer: 

$1,000 
We can infer a bond’s yield from its price using the same relationship. Rather than 
solving for r with algebra, we typically use our financial calculators. For this 
example, if we were given the price of $555.26, the face value of $1,000, and annual 
compounding over 15 years, we would enter the following: 


PV = -555.26 
FV = 1,000 
PMT =0 
N=15 
Then, to get the yield, CPT I/Y = 4.00. 


PROFESSOR’S NOTE 

Remember to enter cash outflows as negative values and cash inflows as 
positive values. From the investor’s point of view, the purchase price (PV) 
is an outflow, and the return of the face value at maturity (FV) is an 
inflow. 


In some circumstances, interest rates can be negative. A zero-coupon bond with a 
negative yield would be priced at a premium, which means its price is greater than 
its face value. 


EXAMPLE: Zero-coupon bond with a negative yield 


If the bond in the previous example has a yield to maturity of —0.5%, what is its 
price, assuming annual compounding? 


Answer: 

$1,000 

— 1,078.09 
(1 - 0.005) : 

A fixed-coupon bond is only slightly more complex. With a coupon bond, the 
investor receives a cash interest payment each period in addition to the face value at 
maturity. The bond’s coupon rate is a percentage of the face value and determines 
the amount of the interest payments. For example, a 3% annual coupon, $1,000 bond 
pays 3% of $1,000, or $30, each year. 


The coupon rate and the yield to maturity are two different things. We only use the 
coupon rate to determine the coupon payment (PMT). The yield to maturity (I/Y) is 
the discount rate implied by the bond’s price. 


| EXAMPLE: Price of an annual coupon bond 


Consider a 10-year, $1,000 par value, 10% coupon, annual-pay bond. What is the 
value of this bond if its yield to maturity is 8%? 


Answer: 


The coupon payments will be 10% x $1,000 = $100 at the end of each year. The 
$1,000 par value will be paid at the end of Year 10, along with the last coupon 
payment. 


The value of this bond with a discount rate (yield to maturity) of 8% is: 


100 , 100 , 100 , =, 100 , 1,100 _ 5 13490 
1.08 © 1.087 


1.089  °** * 1.089 © 1.0810 
The calculator solution is: 

N = 10; PMT = 100; FV = 1,000; I/Y = 8; CPT PV = -1,134.20 
The bond’s value is $1,134.20. 


PROFESSOR’S NOTE 
For this reading where we want to illustrate time value of money 
concepts, we are only using annual coupon payments and compounding 
periods. In the Fixed Income topic area, we will also perform these 
calculations for semiannual-pay bonds. 
Some bonds exist that have no maturity date. We refer to these as perpetual bonds 
or perpetuities. We cannot speak meaningfully of the future value of a perpetuity, 
but its present value simplifies mathematically to the following: 


PV of a perpetuity isa inn 


An amortizing bond is one that pays a level amount each period, including its 
maturity period. The difference between an amortizing bond and a fixed-coupon 
bond is that for an amortizing bond, each payment includes some portion of the 
principal. With a fixed-coupon bond, the entire principal is paid to the investor on 
the maturity date. 


Amortizing bonds are an example of an annuity instrument. For an annuity, the 
payment each period is calculated as follows: 


: x PV 
annuity payment ata 
where: 

r = interest rate per period 

t = number of periods 

PV = present value (principal) 


We can also determine an annuity payment using a financial calculator. 


EXAMPLE: Computing a loan payment 


Suppose you are considering applying for a $2,000 loan that will be repaid with 
equal end-of-year payments over the next 13 years. If the annual interest rate for 
the loan is 6%, how much are your payments? 


Answer: 


The size of the end-of-year loan payment can be determined by inputting values 
for the three known variables and computing PMT. Note that FV = 0 because the 
loan will be fully paid off after the last payment: 


N = 13; 1/Y = 6; PV = -2,000; FV = 0; CPT — PMT = $225.92 


Equity Securities 
As with fixed-income securities, we value equity securities such as common and 


preferred stock as the present value of their future cash flows. The key differences 
are that equity securities do not mature, and their cash flows may change over time. 


Preferred stock pays a fixed dividend that is stated as a percentage of its par value 
(similar to the face value of a bond). As with bonds, we must distinguish between the 
stated percentage that determines the cash flows and the discount rate we apply to 
the cash flows. We say that equity investors have a required return that will induce 
them to own an equity share. This required return is the discount rate we use to 
value equity securities. 


Because we can consider a preferred stock’s fixed stream of dividends to be infinite, 
we can use the perpetuity formula to determine its value: 


D, 
preferred stock value i 


Pp 
where: 
Dp = dividend per period 


kp = the market’s required return on the preferred stock 


EXAMPLE: Preferred stock valuation 


A company’s $100 par preferred stock pays a $5.00 annual dividend and has a 
required return of 8%. Calculate the value of the preferred stock. 


Answer: 
Value of the preferred stock: D,,/Kp = $5.00/0.08 = $62.50 


Common stock is a residual claim to a company’s assets after it satisfies all other 
claims. Common stock typically does not promise a fixed dividend payment. Instead, 
the company’s management decides whether and when to pay common dividends. 


Because the future cash flows are uncertain, we must use models to estimate the 
value of common stock. Here, we will look at three approaches analysts use 
frequently, which we call dividend discount models (DDMs). We will return to 
these examples in the Equity Investments topic area and explain when each model is 
appropriate. 


1. Assume a constant future dividend. Under this assumption, we can value a common 
stock the same way we value a preferred stock, using the perpetuity formula. 

2. Assume a constant growth rate of dividends. With this assumption, we can apply 
the constant growth DDM, also known as the Gordon growth model. In this 
model, we state the value of acommon share as follows: 


D, 


Vv 
. k.- 8 


where: 

Vo = value of a share this period 

D = dividend expected to be paid next period 
ke = required return on common equity 

g- = constant growth rate of dividends 


In this model, Vg represents the PV of all the dividends in future periods, beginning with Dj. 
Note that k, must be greater than g, or the math will not work. 


EXAMPLE: Gordon growth model valuation 
Calculate the value of a stock that is expected to pay a $1.62 dividend next year, 
if dividends are expected to grow at 8% forever and the required return on 
equity is 12%. 
Answer: 

Calculate the stock’s value = D,/(K, — g.) 


= $1.62/(0.12 — 0.08) 
$40.50 


3. Assume a changing growth rate of dividends. This can be done in many ways. The 
example we will use here (and the one that is required for the Level I CFA exam) is 
known as a multistage DDM. Essentially, we assume a pattern of dividends in the 
short term, such as a period of high growth, followed by a constant growth rate of 
dividends in the long term. 

To use a multistage DDM, we discount the expected dividends in the short term as 
individual cash flows, then apply the constant growth DDM to the long term. As 
we Saw in the previous example, the constant growth DDM gives us a value for an 
equity share one period before the dividend we use in the numerator. Therefore, 
with a multistage DDM, we can apply the constant growth DDM to the first 
dividend we assume will grow at a constant rate. 


EXAMPLE: Multistage growth 


Consider a stock with dividends that are expected to grow at 15% per year for 
two years, after which they are expected to grow at 5% per year, indefinitely. 
The last dividend paid was $1.00, and k, = 11%. Calculate the value of this stock 


using the multistage growth model. 
Answer: 


Calculate the dividends over the high growth period: 
D, = Do(1 + g*) = 1.00(1.15) = $1.15 


D, = D,(1 + g*) = 1.15(1.15) = 1.15? = $1.32 


Although we increase D, by the high growth rate of 15% to get D2, D2 will grow 
at the constant growth rate of 5% for the foreseeable future. This property of D» 


allows us to use the constant growth model formula with D> to get P;, a time = 
1 value for all the (infinite) dividends expected from time = 2 onward: 

D2 1.32 
k.—8. 0.11 — 0.05 
Finally, we can sum the PVs of dividend 1 and of P, to get the PV of all the 
expected future dividends during both the high growth and constant growth 
periods: 


1.15 + 22.00 


P, 22.00 


PROFESSOR’S NOTE 

A key point to notice in this example is that when we applied the 
dividend in Period 2 to the constant growth model, it gave us a value for 
the stock in Period 1. To get a value for the stock today, we had to 
discount this value back by one period, along with the dividend in Period 
1 that was not included in the constant growth value. 


~~ 


= MODULE QUIZ 2.1 


1. Terry Corporation preferred stock is expected to pay a $9 annual dividend in 
perpetuity. If the required rate of return on an equivalent investment is 11%, one 
share of Terry preferred should be worth: 

A. $81.82. 
B. $99.00. 
C. $122.22. 


2. Dover Company wants to issue a $10 million face value of 10-year bonds with an 
annual coupon rate of 5%. If the investors’ required yield on Dover’s bonds is 6%, 
the amount the company will receive when it issues these bonds (ignoring 
transactions costs) will be: 

A. less than $10 million. 


B. equal to $10 million. 
C. greater than $10 million. 


MODULE 2.2: IMPLIED RETURNS AND C) 
CASH FLOW ADDITIVITY 


Video covering 
this content is 
available online. 


LOS 2.b: Calculate and interpret the implied return of fixed- 
income instruments and required return and implied growth of 
equity instruments given the present value (PV) and cash flows. 


The examples we have seen so far illustrate the relationships among present value, 
future cash flows, and the required rate of return. We can easily rearrange these 
relationships and solve for the required rate of return, given a security’s price and 
its future cash flows. 


EXAMPLE: Rate of return for a pure discount bond 


A zero-coupon bond with a face value of $1,000 will mature 15 years from today. 
The bond’s price is $650. Assuming annual compounding, what is the investor’s 


annualized return? 


Answer: 
$1,000 
a+an 
(1+) = aed = 1.5385 


r= 1.5385'/"5 — 1 = 0.0291 = 2.91% 


EXAMPLE: Yield of an annual coupon bond 


Consider the 10-year, $1,000 par value, 10% coupon, annual-pay bond we 
examined in an earlier example, when its price was $1,134.20 at a yield to 
maturity of 8%. What is its yield to maturity if its price decreases to $1,085.00? 


Answer: 
N = 10; PMT = 100; FV = 1,000; PV = -1,085; CPT I/Y = 8.6934 
The bond’s yield to maturity increased to 8.69%. 


Notice that the relationship between prices and yields is inverse. When the price 
decreases, the yield to maturity increases. When the price increases, the yield to 
maturity decreases. Or, equivalently, when the yield increases, the price decreases. 
When the yield decreases, the price increases. We will use this concept again and 
again when we study bonds in the Fixed Income topic area. 


In our examples for equity share values, we assumed the investor’s required rate of 
return. In practice, the required rate of return on equity is not directly observable. 
Instead, we use share prices that we can observe in the market to derive implied 
required rates of return on equity, given our assumptions about their future cash 
flows. 


For example, if we assume a constant rate of dividend growth, we can rearrange the 
constant growth DDM to solve for the required rate of return: 


Vv D, 
0 k= 8, 
D,; 
kK, &. Vo 
D, 
k, Vo + g. 


That is, the required rate of return on equity is the ratio of the expected dividend to 
the current price (which we refer to as a share’s dividend yield) plus the assumed 
constant growth rate. 


We can also rearrange the model to solve for a stock’s implied growth rate, given a 
required rate of return: 

1 

k, Vo + §. 

D, 

S. kK, Vo 


That is, the implied growth rate is the required rate of return minus the dividend 
yield. 


LOS 2.c: Explain the cash flow additivity principle, its importance for the no- 
arbitrage condition, and its use in calculating implied forward interest rates, 
forward exchange rates, and option values. 


The cash flow additivity principle refers to the fact that the PV of any stream of 
cash flows equals the sum of the PVs of the cash flows. If we have two series of cash 
flows, the sum of the PVs of the two series is the same as the PVs of the two series 
taken together, adding cash flows that will be paid at the same point in time. We can 
also divide up a series of cash flows any way we like, and the PV of the “pieces” will 
equal the PV of the original series. 


EXAMPLE: Cash flow additivity principle 


A security will make the following payments at the end of the next four years: 
$100, $100, $400, and $100. Calculate the PV of these cash flows using the 
concept of the PV of an annuity when the appropriate discount rate is 10%. 


Answer: 


We can divide the cash flows so that we have: 
f=] f=Z f=3 f=—4 
100 100 100 100 Cash flow series #1 
0 0 300 me. Cash flow series #2 


$100 $100 $400 $100 


The additivity principle tells us that to get the PV of the original series, we can 
just add the PVs of cash flow series #1 (a 4-period annuity) and cash flow series 
#2 (a single payment three periods from now). 

For the annuity: N = 4; PMT = 100; FV = 0; I/Y = 10; CPT > PV = -$316.99 

For the single payment: N = 3; PMT = 0; FV = 300; I/Y = 10; CPT > PV = -$225.39 
The sum of these two values is 316.99 + 225.39 = $542.38. 


The sum of these two (present) values is identical (except for rounding) to the 
sum of the present values of the payments of the original series: 

100 , 100 , 400 , 100 _ ¢549 3g 

lif 1h LY 


This is a simple example of replication. In effect, we created the equivalent of the 
given series of uneven cash flows by combining a 4-year annuity of 100 with a 3- 
year zero-coupon bond of 300. 


We rely on the cash flow additivity principle in many of the pricing models we see 
in the Level I CFA curriculum. It is the basis for the no-arbitrage principle, or “law 
of one price,’ which says that if two sets of future cash flows are identical under all 
conditions, they will have the same price today (or if they don’t, investors will 
quickly buy the lower-priced one and sell the higher-priced one, which will drive 
their prices together). 


Three examples of valuation based on the no-arbitrage condition are forward 
interest rates, forward exchange rates, and option pricing using a binomial model. 
We will explain each of these examples in greater detail when we address the related 
concepts in the Fixed Income, Economics, and Derivatives topic areas. For now, just 
focus on how they apply the principle that equivalent future cash flows must have 
the same present value. 


Forward Interest Rates 


A forward interest rate is the interest rate for a loan to be made at some future date. 
The notation used must identify both the length of the loan and when in the future 
the money will be borrowed. Thus, 1y1y is the rate for a 1-year loan to be made one 
year from now; 2y1y is the rate for a 1-year loan to be made two years from now; 
3y2y is the 2-year forward rate three years from now; and so on. 


By contrast, a spot interest rate is an interest rate for a loan to be made today. We 
will use the notation S, for a 1-year rate today, Sz for a 2-year rate today, and so on. 


The way the cash flow additivity principle applies here is that, for example, 
borrowing for three years at the 3-year spot rate, or borrowing for one-year periods 
in three successive years, should have the same cost today. This relation is 
illustrated as follows: (1 + S3)? = (1 + S,)(1 + lyly)(4 + 2y1y). 


In fact, any combination of spot and forward interest rates that cover the same time 
period should have the same cost. Using this idea, we can derive implied forward 
rates from spot rates that are observable in the fixed-income markets. 


EXAMPLE: Computing a forward rate from spot rates 

The 2-period spot rate, Sz, is 8%, and the 1-period spot rate, S;, is 4%. Calculate 
the forward rate for one period, one period from now, 1y1y. 

Answer: 


The following figure illustrates the problem. 
Finding a Forward Rate 
2-year bond (S, = 8.0%) 


1-year bond (today) 1-year bond 
(S, = 4.000%) (one year from today)(1y1y = ?) 


) 1 2 

From our original equality, (1 + S,)? = (1 + S,)(1 + 1y1y), we can get the following: 
(1+ S,) ee 
(+5) = (1 + lyly) 


Or, because we know that both choices have the same payoff in two years: 


(1.08)? = (1.04)(1 + lyly) 


(1.08)? 
1+ lyly) = 
(1 + lyly) = “7 oa) 
2 (1.08) 1.1664 > 1549 


In other words, investors are willing to accept 4.0% on the 1-year bond today 
(when they could get 8.0% on the 2-year bond today) only because they can get 
12.154% ona 1-year bond one year from today. This future rate that can be 
locked in today is a forward rate. 


Forward Currency Exchange Rates 


An exchange rate is the price of one country’s currency in terms of another 
country’s currency. For example, an exchange rate of 1.416 USD/EUR means that one 
euro (EUR) is worth 1.416 USS. dollars (USD). The Level I CFA curriculum refers to 
the currency in the numerator (USD, in this example) as the price currency and the 
one in the denominator (EUR in this example) as the base currency. 


Like interest rates, exchange rates can be quoted as spot rates for currency 
exchanges to be made today, or as forward rates for currency exchanges to be made 
at a future date. 


The percentage difference between forward and spot exchange rates is 
approximately the difference between the two countries’ interest rates. This is 
because there is an arbitrage trade with a riskless profit to be made when this 
relation does not hold. 


The possible arbitrage is as follows: borrow Currency A at Interest Rate A, convert it 
to Currency B at the spot rate and invest it to earn Interest Rate B, and sell the 
proceeds from this investment forward at the forward rate to turn it back into 
Currency A. If the forward rate does not correctly reflect the difference between 
interest rates, such an arbitrage could generate a profit to the extent that the return 
from investing Currency B and converting it back to Currency A with a forward 
contract is greater than the cost of borrowing Currency A for the period. 


For spot and forward rates expressed as price currency /base currency, the no- 
arbitrage relation is as follows: 


—e ( 1 + interest Tate vice en 


spot — (1+ interest rate, .....) 


This formula can be rearranged as necessary to solve for specific values of the 
relevant terms. 


EXAMPLE: Calculating the arbitrage-free forward exchange rate 


Consider two currencies, the ABE and the DUB. The spot ABE/DUB exchange rate 
is 4.5671, the 1-year riskless ABE rate is 5%, and the 1-year riskless DUB rate is 
3%. What is the 1-year forward exchange rate that will prevent arbitrage profits? 


Answer: 


Rearranging our formula, we have: 


1+I1 
forward = spot ead 
1+ Ipus 


and we can calculate the forward rate as: 


forward — 4.5671 (He) ~ 4.6558 ABE / DUB 


As you can see, the forward rate is greater than the spot rate by 4.6558 / 4.5671 - 


1 = 1.94%. This is approximately equal to the interest rate differential of 5% - 3% 
= 2%. 


Option Pricing Model 


An option is the right, but not the obligation, to buy or sell an asset on a future date 
for a specified price. The right to buy an asset is a call option, and the right to sell an 
asset is a put option. 


Valuing options is different from valuing other securities because the owner can let 
an option expire unexercised. A call option owner will let the option expire if the 
underlying asset can be bought in the market for less than the price specified in the 
option. A put option owner will let the option expire if the underlying asset can be 
sold in the market for more than the price specified in the option. In these cases, we 
say an option is out of the money. If an option is in the money on its expiration date, 
the owner has the right to buy the asset for less, or sell the asset for more, than its 
market price—and, therefore, will exercise the option. 


An approach to valuing options that we will use in the Derivatives topic area is a 
binomial model. A binomial model is based on the idea that, over the next period, 
some value will change to one of two possible values. To construct a one-period 
binomial model for pricing an option, we need the following: 

= A value for the underlying asset at the beginning of the period 


=» An exercise price for the option; the exercise price can be different from the value 
of the underlying, and we assume the option expires one period from now 


= Returns that will result from an up-move and a down-move in the value of the 
underlying over one period 


= The risk-free rate over the period 


As an example, we can model a call option with an exercise price of $55 ona stock 
that is currently valued (S,) at $50. Let us assume that in one period, the stock’s 


value will either increase (S¥) to $60 or decrease (S?) to $42. We state the return 


from an up-move (R") as $60 / $50 = 1.20, and the return from a down-move (R®) as 
$42 / $50 = 0.84. 


Figure 2.1: One Period Binomial Tree 


_» $50 1.20 = $60 


~™ $50 x 0.84 = $42 


The call option will be in the money after an up-move or out of the money after a 
down-move. Its value at expiration after an up-move, c}, is $60 - $55 = $5. Its value 
after a down-move, cf, is zero. 


Now, we can use no-arbitrage pricing to determine the initial value of the call option 
(Cg). We do this by creating a portfolio of the option and the underlying stock, such 
that the portfolio will have the same value following either an up-move (V¥) or a 
down-move (V$) in the stock. For our example, we would write the call option (that 
is, we grant someone else the option to buy the stock from us) and buy a number of 
shares of the stock that we will denote as h. We must solve for the / that results in 
Ve = ve: 
« The initial value of our portfolio, Vo, is hSg — Cg (we subtract cg because we are 
short the call option). 
I 
= The portfolio value after a down-move, V, is hS¢ — c@. 


= The portfolio value after an up-move, V¥, is BS} — c 


In our example, V¥ = h($60) — $5, and V? = h($42) — 0. Setting V? = V¢ and solving for h, 
we get the following: 


h($60) - $5 = h($42) 

h($60) - h($42) = $5 

h = $5 / ($60 - $42) = 0.278 
This result—the number of shares of the underlying we would buy for each call 
option we would write—is known as the hedge ratio for this option. 


With vy = Vv, the value of the portfolio after one period is known with certainty. 
This means we can say that either V? or V? must equal Vp compounded at the risk- 
free rate for one period. In this example, V¢ = 0.278($42) = $11.68, or 

V} = 0.278($00) — $5 = $11.68. Let us assume the risk-free rate over one period is 3%. 
Then, Vo = $11.68 / 1.03 = $11.34. 


Now, we can solve for the value of the call option, cp. Recall that Vg = hSg — cg, So Cp 
= hSo — Vo. Here, Co = 0.278($50) - $11.34 = $2.56. 


o 


= MODULE QUIZ 2.2 


1. For an equity share with a constant growth rate of dividends, we can estimate its: 
A. value as the next dividend discounted at the required rate of return. 
B. growth rate as the sum of its required rate of return and its dividend yield. 
C. required return as the sum of its constant growth rate and its dividend yield. 


2. An investment of €5 million today is expected to produce a one-time payoff of €7 
million three years from today. The annual return on this investment, assuming 
annual compounding, is closest to: 

A. 12%. 
B. 13%. 
C. 14%. 


KEY CONCEPTS 


LOS 2.a 


The value of a fixed-income instrument or an equity security is the present value of 
its future cash flows, discounted at the investor’s required rate of return: 


FV 


V- ae 


=FV(1+r1)° 


where: 
r = interest rate per compounding period 
t = number of compounding periods 


annuity payment — aaa 
where: 

r = interest rate per period 

t = number of periods 

PV = present value (principal) 


k _ Payment 
es 


The PV of a perpetual bond or a preferred stoc , where r = required rate of 


return. 


The PV of acommon stock with a constant growth rate of dividends is: 
D, 


a 
i. 


LOS 2.b 

By rearranging the present value relationship, we can calculate a security’s required 
rate of return based on its price and its future cash flows. The relationship between 
prices and required rates of return is inverse. 


For an equity share with a constant rate of dividend growth, we can estimate the 
required rate of return as the dividend yield plus the assumed constant growth rate, 
or we can estimate the implied growth rate as the required rate of return minus the 
dividend yield. 


LOS 2.¢e 


Using the cash flow additivity principle, we can divide up a series of cash flows any 
way we like, and the present value of the pieces will equal the present value of the 
original series. This principle is the basis for the no-arbitrage condition, under 
which two sets of future cash flows that are identical must have the same present 
value. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 2.1 


1.A 9/0.11 = $81.82 (LOS 2.a) 


2.A Because the required yield is greater than the coupon rate, the present value of 
the bonds is less than their face value: N = 10; I/Y = 6; PMT = 0.05 x $10,000,00 
= $500,000; FV = $10,000,000; and CPT PV = -$9,263,991. (LOS 2.a) 


Module Quiz 2.2 


1.C Using the constant growth dividend discount model, we can estimate the 


D 
required rate of return as k, a + g.. The estimated value of a share is all of 
0 


its future dividends discounted at the required rate of return, which simplifies 
D 
1 
to Vo =; 


if we assume a constant growth rate. We can estimate the 
e c 


constant growth rate as the required rate of return minus the dividend yield. 
(LOS 2.b) 


1 
2.A (2) 1 = 0.1187 


(LOS 2.b) 


READING 3 


STATISTICAL MEASURES OF 
ASSET RETURNS 


MODULE 3.1: CENTRAL TENDENCY AND a 
DISPERSION 


Video covering 
this content is 


LOS 3.a: Calculate, interpret, and evaluate measures of central —4vailable online. 
tendency and location to address an investment problem. 


Measures of Central Tendency 


Measures of central tendency identify the center, or average, of a dataset. This 
central point can then be used to represent the typical, or expected, value in the 
dataset. 


The arithmetic mean is the sum of the observation values divided by the number of 
observations. It is the most widely used measure of central tendency. An example of 
an arithmetic mean is a sample mean, which is the sum of all the values in a sample 
of a population, =X, divided by the number of observations in the sample, n. It is used 
to make inferences about the population mean. The sample mean is expressed as 
follows: 


2% 


X= a 


The median is the midpoint of a dataset, where the data are arranged in ascending 
or descending order. Half of the observations lie above the median, and half are 
below. To determine the median, arrange the data from the highest to lowest value, 
or lowest to highest value, and find the middle observation. 


The median is important because the arithmetic mean can be affected by outliers, 
which are extremely large or small values. When this occurs, the median is a better 
measure of central tendency than the mean because it is not affected by extreme 
values that may actually be the result of errors in the data. 


EXAMPLE: The median using an odd number of observations 


What is the median return for five portfolio managers with a 10-year annualized 
total returns record of 30%, 15%, 25%, 21%, and 23%? 


Answer: 


First, arrange the returns in descending order: 
30%, 25%, 23%, 21%, 15% 
Then, select the observation that has an equal number of observations above and 


below it—the one in the middle. For the given dataset, the third observation, 23%, 
is the median value. 


EXAMPLE: The median using an even number of observations 


Suppose we add a sixth manager to the previous example with a return of 28%. 
What is the median return? 


Answer: 
Arranging the returns in descending order gives us this: 
30%, 28%, 25%, 23%, 21%, 15% 


With an even number of observations, there is no single middle value. The median 
value, in this case, is the arithmetic mean of the two middle observations, 25% 
and 23%. Thus, the median return for the six managers is 24% = 0.5(25 + 23). 


The mode is the value that occurs most frequently in a dataset. A dataset may have 
more than one mode, or even no mode. When a distribution has one value that 
appears most frequently, it is said to be unimodal. When a dataset has two or three 
values that occur most frequently, it is said to be bimodal or trimodial, respectively. 


EXAMPLE: The mode 

What is the mode of the following dataset? 
Dataset: [30%, 28%, 25%, 23%, 28%, 15%, 5%] 
Answer: 


The mode is 28% because it is the value appearing most frequently. 


For continuous data, such as investment returns, we typically do not identify a 
single outcome as the mode. Instead, we divide the relevant range of outcomes into 
intervals, and we identify the modal interval as the one into which the largest 
number of observations fall. 


Methods for Dealing With Outliers 


In some cases, a researcher may decide that outliers should be excluded from a 
measure of central tendency. One technique for doing so is to use a trimmed mean. 
A trimmed mean excludes a stated percentage of the most extreme observations. A 
1% trimmed mean, for example, would discard the lowest 0.5% and the highest 0.5% 
of the observations. 


Another technique is to use a winsorized mean. Instead of discarding the highest 
and lowest observations, we substitute a value for them. To calculate a 90% 
winsorized mean, for example, we would determine the 5th and 95th percentile of 
the observations, substitute the 5th percentile for any values lower than that, 
substitute the 95th percentile for any values higher than that, and then calculate the 


mean of the revised dataset. Percentiles are measures of location, which we will 
address next. 


Measures of Location 

Quantile is the general term for a value at or below which a stated proportion of the 
data in a distribution lies. Examples of quantiles include the following: 

= Quartile. The distribution is divided into quarters. 

= Quintile. The distribution is divided into fifths. 

= Decile. The distribution is divided into tenths. 

= Percentile. The distribution is divided into hundredths (percentages). 

Note that any quantile may be expressed as a percentile. For example, the third 
quartile partitions the distribution at a value such that three-fourths, or 75%, of the 
observations fall below that value. Thus, the third quartile is the 75th percentile. The 


difference between the third quartile and the first quartile (25th percentile) is 
known as the interquartile range. 


To visualize a dataset based on quantiles, we can create a box and whisker plot, as 
shown in Figure 3.1. In a box and whisker plot, the box represents the central 
portion of the data, such as the interquartile range. The vertical line represents the 
entire range. In Figure 3.1, we can see that the largest observation is farther away 
from the center than is the smallest observation. This suggests that the data might 
include one or more outliers on the high side. 


Figure 3.1: Box and Whisker Plot 


100 ——____ = Targest observation SSCS 
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LOS 3.b: Calculate, interpret, and evaluate measures of dispersion to address 
an investment problem. 


Dispersion is defined as the variability around the central tendency. The common 
theme in finance and investments is the tradeoff between reward and variability, 


where the central tendency is the measure of the reward and dispersion is a measure 
of risk. 


The range is a relatively simple measure of variability, but when used with other 
measures, it provides useful information. The range is the distance between the 
largest and the smallest value in the dataset: 


range = maximum value - minimum value 


EXAMPLE: The range 


What is the range for the 5-year annualized total returns for five investment 
managers if the managers’ individual returns were 30%, 12%, 25%, 20%, and 
23%? 

Answer: 

range = 30 - 12 = 18% 


The mean absolute deviation (MAD) is the average of the absolute values of the 
deviations of individual observations from the arithmetic mean: 


ee 


re aici 


n 


The computation of the MAD uses the absolute values of each deviation from the 
mean because the sum of the actual deviations from the arithmetic mean is zero. 


EXAMPLE: MAD 


What is the MAD of the investment returns for the five managers discussed in the 
preceding example? How is it interpreted? 


Answer: 


Annualized returns: [30%, 12%, 25%, 20%, 23%] 
y [30+ 12 + 25 + 20 + 23] 


xX = 22% 

= = = = fi Boe, 
map = L!30 a2 4- 132 — 2214+ 15 rendOA at 22| + 123 — 221] 
map = 8+ 10+3+2+1)_ 494 
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This result can be interpreted to mean that, on average, an individual return will 
deviate +4.8% from the mean return of 22%. 


The sample variance, s*, is the measure of dispersion that applies when we are 
evaluating a sample of n observations from a population. The sample variance is 
calculated using the following formula: 


2%: — X) 
= n-1 


The denominator for s* is n- 1, one less than the sample size n. Based on the 
mathematical theory behind statistical procedures, the use of the entire number of 
sample observations, n, instead of n - 1 as the divisor in the computation of s*, will 


systematically underestimate the population variance—particularly for small 
sample sizes. This systematic underestimation causes the sample variance to be a 
biased estimator of the population variance. Using n - 1 instead of n in the 
denominator, however, improves the statistical properties of s* as an estimator of 
the population variance. 


EXAMPLE: Sample variance 


Assume that the 5-year annualized total returns for the five investment managers 
used in the preceding examples represent only a sample of the managers at a large 
investment firm. What is the sample variance of these returns? 


Answer: 
— [30+12+25+20+ 23 
ak al Hanae OS 
> [0 — 22)? + 2 — 22)? + (25 — 22)? + (20 — 22)? + (23 — 22)| 
a i i aie = a ai 
= 44.5(%) 


Thus, the sample variance of 44.5(%7) can be interpreted to be an unbiased 
estimator of the population variance. Note that 44.5 “percent squared” is 0.00445, 
and you will get this value if you put the percentage returns in decimal form [e.g., 
(0.30 - 0.22)?]. 


A major problem with using variance is the difficulty of interpreting it. The 
computed variance, unlike the mean, is in terms of squared units of measurement. 
How does one interpret squared percentages, squared dollars, or squared yen? This 
problem is mitigated through the use of the standard deviation. The units of standard 
deviation are the same as the units of the data (e.g., percentage return, dollars, 
euros). The sample standard deviation is the square root of the sample variance. 
The sample standard deviation, s, is calculated as follows: 


' n—-1 


EXAMPLE: Sample standard deviation 


Compute the sample standard deviation based on the result of the preceding 
example. 


Answer: 


Because the sample variance for the preceding example was computed to be 
44.5(%7), this is the sample standard deviation: 

s = [44.5(%7)]}/? = 6.67%, V0.00445 = 0.0667 
This means that on average, an individual return from the sample will deviate 


+6.67% from the mean return of 22%. The sample standard deviation can be 
interpreted as an unbiased estimator of the population standard deviation. 


A direct comparison between two or more measures of dispersion may be difficult. 
For instance, suppose you are comparing the annual returns distribution for retail 
stocks with a mean of 8% and an annual returns distribution for a real estate 


portfolio with a mean of 16%. A direct comparison between the dispersion of the 
two distributions is not meaningful because of the relatively large difference in their 
means. To make a meaningful comparison, a relative measure of dispersion must be 
used. Relative dispersion is the amount of variability in a distribution around a 
reference point or benchmark. Relative dispersion is commonly measured with the 
coefficient of variation (CV), which is computed as follows: 

cv — 2x — standard deviation of x 

= average value of x 

CV measures the amount of dispersion in a distribution relative to the distribution’s 
mean. This is useful because it enables us to compare dispersion across different sets 
of data. In an investments setting, the CV is used to measure the risk (variability) per 
unit of expected return (mean). A lower CV is better. 


EXAMPLE: Coefficient of variation 


You have just been presented with a report that indicates that the mean monthly 
return on T-bills is 0.25% with a standard deviation of 0.36%, and the mean 
monthly return for the S&P 500 is 1.09% with a standard deviation of 7.30%. Your 
manager has asked you to compute the CV for these two investments and to 
interpret your results. 


Answer: 
.36 
CV isms = 995 = 1-4 
7.30 
CV cap 500 = 1.09 — 6.70 


These results indicate that there is less dispersion (risk) per unit of monthly 
return for T-bills than for the S&P 500 (1.44 vs. 6.70). 


PROFESSOR’S NOTE 

To remember the formula for CV, remember that the CV is a measure of 
variation, so standard deviation goes in the numerator. CV is variation 
per unit of return. 


When we use variance or standard deviation as risk measures, we calculate risk 
based on outcomes both above and below the mean. In some situations, it may be 
more appropriate to consider only outcomes less than the mean (or some other 
specific value) in calculating a risk measure. In this case, we are measuring 
downside risk. 


One measure of downside risk is target downside deviation, which is also known 
as target semideviation. Calculating target downside deviation is similar to 
calculating standard deviation, but in this case, we choose a target value against 
which to measure each outcome and only include deviations from the target value in 
our calculation if the outcomes are below that target. 


The formula for target downside deviation is stated as follows: 


\ 2&8) 
at X. 


Starget n—1 


where B = the target 


Note that the denominator remains the sample size n minus one, even though we are 
not using all of the observations in the numerator. 


EXAMPLE: Target downside deviation 


Calculate the target downside deviation based on the data in the preceding 
examples, for a target return equal to the mean (22%), and for a target return of 
24%. 


Answer: 


Return Deviation From Deviation From 

Mean Target Return 
30% 30% — 22% = 8% 30% — 24% = 6% 
12% 12% — 22% =-10% 12% -— 24%=—-12% 
25% 25% — 22% = 3% 25% — 24% = 1% 
20% 20% — 22%=-2% 20% — 24% = —4% 
23% 23% — 22% = 1% 23% — 24% = —1% 

Som, = {ot CO 5.10% 


(—12)7 + (—4)? + (-1)? 


S540, — = — 6.34% 


“» MODULE QUIZ 3.1 
=h 
1. A dataset has 100 observations. Which of the following measures of central 
tendency will be calculated using a denominator of 100? 
A. The winsorized mean, but not the trimmed mean. 
B. Both the trimmed mean and the winsorized mean. 


C. Neither the trimmed mean nor the winsorized mean. 
2. XYZ Corp. Annual Stock Returns 


20X1 20X2 20X3 20X4 20X5 20X6 
22% 5% 7% 11% 2% 11% 


What is the sample standard deviation? 
A. 9.8%. 
B. 72.4%. 
C. 96.3%. 
3. XYZ Corp. Annual Stock Returns 
20X1 20X2 20X3 20X4 20X5 20X6 
22% 5% 7% 11% 2% 11% 


Assume an investor has a target return of 11% for XYZ stock. What is the stock’s 
target downside deviation? 

A. 9.389%. 

B. 12.10%. 

C. 14.80%. 


MODULE 3.2: SKEWNESS, KURTOSIS, >) 
AND CORRELATION 


Video covering 
LOS 3.c: Interpret and evaluate measures of skewness and ae poms 18 
kurtosis to address an investment problem. iaanevaicaiae 


A distribution is symmetrical if it is shaped identically on both sides of its mean. 
Distributional symmetry implies that intervals of losses and gains will exhibit the 
same frequency. For example, a symmetrical distribution with a mean return of zero 
will have losses in the -6% to -4% interval as frequently as it will have gains in the 
+4% to +6% interval. The extent to which a returns distribution is symmetrical is 
important because the degree of symmetry tells analysts if deviations from the 
mean are more likely to be positive or negative. 


Skewness, or skew, refers to the extent to which a distribution is not symmetrical. 
Nonsymmetrical distributions may be either positively or negatively skewed and 
result from the occurrence of outliers in the dataset. Outliers are observations 
extraordinarily far from the mean, either above or below: 


« A positively skewed distribution is characterized by outliers greater than the mean 
(in the upper region, or right tail). A positively skewed distribution is said to be 
skewed right because of its relatively long upper (right) tail. 


« A negatively skewed distribution has a disproportionately large amount of outliers 
less than the mean that fall within its lower (left) tail. A negatively skewed 
distribution is said to be skewed left because of its long lower tail. 


Skewness affects the location of the mean, median, and mode of a distribution: 


=» For asymmetrical distribution, the mean, median, and mode are equal. 


= Fora positively skewed, unimodal distribution, the mode is less than the median, 
which is less than the mean. The mean is affected by outliers; in a positively 
skewed distribution, there are large, positive outliers, which will tend to pull the 
mean upward, or more positive. An example of a positively skewed distribution is 
that of housing prices. Suppose you live in a neighborhood with 100 homes; 99 of 
them sell for $100,000, and one sells for $1,000,000. The median and the mode will 
be $100,000, but the mean will be $109,000. Hence, the mean has been pulled 
upward (to the right) by the existence of one home (outlier) in the neighborhood. 


= For a negatively skewed, unimodal distribution, the mean is less than the median, 
which is less than the mode. In this case, there are large, negative outliers that 
tend to pull the mean downward (to the left). 


PROFESSOR’S NOTE 

The key to remembering how measures of central tendency are affected 
by skewed data is to recognize that skew affects the mean more than the 
median and mode, and the mean is pulled in the direction of the skew. 
The relative location of the mean, median, and mode for different 
distribution shapes is shown in Figure 3.2. Note that the median is 
between the other two measures for positively or negatively skewed 
distributions. 


Figure 3.2: Effect of Skewness on Mean, Median, and Mode 


Symmetrical 
(Mean = Median = Mode) 


Mean 
Median 
Mode 
Positive (right) skew 
(Mean > Median > Mode) 


Mean 
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Mode 


Negative (left) skew 
(Mean < Median < Mode) 


Mean 
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Sample skewness is equal to the sum of the cubed deviations from the mean 
divided by the cubed standard deviation and by the number of observations. Sample 
skewness for large samples is approximated as follows: 


. LX -x)° 
sample skewness ~ (t) = >=— 


where: 


s = sample standard deviation 


PROFESSOR’S NOTE 
The LOS requires us to “interpret and evaluate” measures of skewness and 
kurtosis, but not to calculate them. 


Note that the denominator is always positive, but that the numerator can be positive 
or negative depending on whether observations above the mean or observations 
below the mean tend to be farther from the mean, on average. When a distribution is 
right skewed, sample skewness is positive because the deviations above the mean 
are larger, on average. A left-skewed distribution has a negative sample skewness. 


Dividing by standard deviation cubed standardizes the statistic and allows 
interpretation of the skewness measure. If relative skewness is equal to zero, the 
data are not skewed. Positive levels of relative skewness imply a positively skewed 


distribution, whereas negative values of relative skewness imply a negatively 
skewed distribution. Values of sample skewness in excess of 0.5 in absolute value are 
considered significant. 


Kurtosis is a measure of the degree to which a distribution is more or less peaked 
than a normal distribution. Leptokurtic describes a distribution that is more 
peaked than a normal distribution, whereas platykurtic refers to a distribution that 
is less peaked, or flatter than a normal one. A distribution is mesokurtic if it has the 
same kurtosis as a normal distribution. 


As indicated in Figure 3.3, a leptokurtic return distribution will have more returns 
clustered around the mean and more returns with large deviations from the mean 
(fatter tails). Relative to a normal distribution, a leptokurtic distribution will have a 
greater percentage of small deviations from the mean and a greater percentage of 
extremely large deviations from the mean. This means that there is a relatively 
greater probability of an observed value being either close to the mean or far from 
the mean. Regarding an investment returns distribution, a greater likelihood of a 
large deviation from the mean return is often perceived as an increase in risk. 


Figure 3.3: Kurtosis 
\e— Leptokurtic 


Normal Distribution 


a 


A distribution is said to exhibit excess kurtosis if it has either more or less kurtosis 
than the normal distribution. The computed kurtosis for all normal distributions is 
three. Statisticians, however, sometimes report excess kurtosis, which is defined as 
kurtosis minus three. Thus, a normal distribution has excess kurtosis equal to zero, a 
leptokurtic distribution has excess kurtosis greater than zero, and platykurtic 
distributions will have excess kurtosis less than zero. 


Kurtosis is critical in a risk management setting. Most research about the 
distribution of securities returns has shown that returns are not normally 
distributed. Actual securities returns tend to exhibit both skewness and kurtosis. 
Skewness and kurtosis are critical concepts for risk management because when 
securities returns are modeled using an assumed normal distribution, the 
predictions from the models will not take into account the potential for extremely 
large, negative outcomes. In fact, most risk managers put very little emphasis on the 
mean and standard deviation of a distribution and focus more on the distribution of 
returns in the tails of the distribution—that is where the risk is. In general, greater 
excess kurtosis and more negative skew in returns distributions indicate increased 
risk. 


Sample kurtosis for large samples is approximated using deviations raised to the 
fourth power: 


sample kurtosis ~ (2) i=l 


where: 
s = sample standard deviation 


LOS 3.d: Interpret correlation between two variables to address an 
investment problem. 


Scatter plots are a method for displaying the relationship between two variables. 
With one variable on the vertical axis and the other on the horizontal axis, their 
paired observations can each be plotted as a single point. For example, in Panel A of 
Figure 3.4, the point farthest to the upper right shows that when one of the variables 
(on the horizontal axis) equaled 9.2, the other variable (on the vertical axis) equaled 
8.5. 


The scatter plot in Panel A is typical of two variables that have no clear relationship. 
Panel B shows two variables that have a strong linear relationship—that is, a high 
correlation coefficient. 


A key advantage of creating scatter plots is that they can reveal nonlinear 
relationships, which are not described by the correlation coefficient. Panel C 
illustrates such a relationship. Although the correlation coefficient for these two 
variables is close to zero, their scatter plot shows clearly that they are related ina 
predictable way. 


Figure 3.4: Scatter Plots 


a) No relationship b) Strong linear relationship 
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Covariance is a measure of how two variables move together. The calculation of the 
sample covariance is based on the following formula: 


> {[Ki- ] 
t=1 


SxY n—1 


¥;,-Y}} 


where: 
X; = an observation of variable X 
Y. = an observation of variable Y 


X = mean of variable X 
Y = mean of variable Y 
n = number of periods 


In practice, the covariance is difficult to interpret. The value of covariance depends 
on the units of the variables. The covariance of daily price changes of two securities 
priced in yen will be much greater than their covariance if the securities are priced 
in dollars. Like the variance, the units of covariance are the square of the units used 
for the data. 


Additionally, we cannot interpret the relative strength of the relationship between 
two variables. Knowing that the covariance of X and Y is 0.8756 tells us only that 
they tend to move together because the covariance is positive. A standardized 
measure of the linear relationship between two variables is called the correlation 
coefficient, or simply correlation. The correlation between two variables, X and Y, is 


calculated as follows: 


p SxXY 
XY SySy’ 


which implies: 
“xy ™ Pxy*x*y 

The properties of the correlation of two random variables, X and Y, are summarized 

here: 

= Correlation measures the strength of the linear relationship between two random 
variables. 

= Correlation has no units. 

» The correlation ranges from —1 to +1. That is, -1 < pyy < +1. 

« If pyy = 1.0, the random variables have perfect positive correlation. This means 
that a movement in one random variable results in a proportional positive 
movement in the other relative to its mean. 

= If pyy =—-1.0, the random variables have perfect negative correlation. This means 
that a movement in one random variable results in an exact opposite proportional 
movement in the other relative to its mean. 

= If pyy =0, there is no linear relationship between the variables, indicating that 
prediction of Y cannot be made on the basis of X using linear methods. 


EXAMPLE: Correlation 


The variance of returns on Stock A is 0.0028, the variance of returns on Stock B 
is 0.0124, and their covariance of returns is 0.0058. Calculate and interpret the 
correlation of the returns for Stocks A and B. 


Answer: 


First, it is necessary to convert the variances to standard deviations: 
S, = (0.0028) = 0.0529 
se (0024) = Osa 


Now, the correlation between the returns of Stock A and Stock B can be 
computed as follows: 
0.0058 
PAB ~ (0.05290.1114) — 
The fact that this value is close to +1 indicates that the linear relationship is not 
only positive, but also is very strong. 


0.9842 


Care should be taken when drawing conclusions based on correlation. Causation is 
not implied just from significant correlation. Even if it were, which variable is 
causing change in the other is not revealed by correlation. It is more prudent to say 
that two variables exhibit positive (or negative) association, suggesting that the 
nature of any causal relationship is to be separately investigated or based on theory 
that can be subject to additional tests. 


One question that can be investigated is the role of outliers (extreme values) in the 
correlation of two variables. If removing the outliers significantly reduces the 
calculated correlation, further inquiry is necessary into whether the outliers 
provide information or are caused by noise (randomness) in the data used. 


Spurious correlation refers to correlation that is either the result of chance or 
present due to changes in both variables over time that is caused by their 
association with a third variable. For example, we can find instances where two 
variables that are both related to the inflation rate exhibit significant correlation, 
but for which causation in either direction is not present. 


In his book Spurious Correlation", Tyler Vigen presents the following examples. The 
correlation between the age of each year’s Miss America and the number of films 
Nicolas Cage appeared in that year is 87%. This seems a bit random. The correlation 
between the U.S. spending on science, space, and technology and suicides by hanging, 
strangulation, and suffocation over the 1999-2009 period is 99.87%. Impressive 
correlation, but both variables increased in an approximately linear fashion over the 
period. 


~ 


= MODULE QUIZ 3.2 


1. Which of the following is most accurate regarding a distribution of returns that 
has a mean greater than its median? 
A. It is positively skewed. 
B. It is a symmetric distribution. 
C. It has positive excess kurtosis. 


2. A distribution of returns that has a greater percentage of small deviations from 
the mean and a greater percentage of extremely large deviations from the mean 
compared with a normal distribution: 

A. is positively skewed. 
B. has positive excess kurtosis. 
C. has negative excess kurtosis. 


3. The correlation between two variables is +0.25. The most appropriate way to 
interpret this value is to say: 
A. a scatter plot of the two variables is likely to show a strong linear relationship. 
B. when one variable is above its mean, the other variable tends to be above its 
mean as well. 
C. a change in one of the variables usually causes the other variable to change in 
the same direction. 


KEY CONCEPTS 


LOS 3.a 


The arithmetic mean is the average of observations. The sample mean is the 
arithmetic mean of a sample: 


The median is the midpoint of a dataset when the data are arranged from largest to 
smallest. 


The mode of a dataset is the value that occurs most frequently. The modal interval is 
a measure of mode for continuous data. 


A trimmed mean omits outliers, and a winsorized mean replaces outliers with given 
values, reducing the effect of outliers on the mean in both cases. 


Quantile is the general term for a value at or below which lies a stated proportion of 
the data in a distribution. Examples of quantiles include the following: 

= Quartile. The distribution is divided into quarters. 

= Quintile. The distribution is divided into fifths. 

= Decile. The distribution is divided into tenths. 


» Percentile. The distribution is divided into hundredths (percentages). 
LOS 3.b 
The range is the difference between the largest and smallest values in a dataset. 


Mean absolute deviation (MAD) is the average of the absolute values of the 
deviations from the arithmetic mean: 


> IXi- x! 
MAD = =!) __ 


Variance is defined as the mean of the squared deviations from the arithmetic mean: 


- ) 
sample variance = s~ = 


where: 


X = sample mean 


n = sample size 


Standard deviation is the positive square root of the variance, and it is frequently 
used as a quantitative measure of risk. 
$ 


The coefficient of variation (CV) for sample data, CV = re is the ratio of the standard 


deviation of the sample to its mean. 


Target downside deviation or semideviation is a measure of downside risk: 


Starget n—1 


where B = the target 


LOS 3.c 

Skewness describes the degree to which a distribution is not symmetric about its 
mean. A right-skewed distribution has positive skewness. A left-skewed distribution 
has negative skewness. 


For a positively skewed, unimodal distribution, the mean is greater than the median, 
which is greater than the mode. For a negatively skewed, unimodal distribution, the 
mean is less than the median, which is less than the mode. 


Kurtosis measures the peakedness of a distribution and the probability of extreme 
outcomes (thickness of tails): 


» Excess kurtosis is measured relative to a normal distribution, which has a kurtosis 
of 3. 


= Positive values of excess kurtosis indicate a distribution that is leptokurtic (fat 
tails, more peaked), so the probability of extreme outcomes is greater than for a 
normal distribution. 


= Negative values of excess kurtosis indicate a platykurtic distribution (thin tails, 
less peaked). 
LOS 3.d 


Correlation is a standardized measure of association between two random variables. 


: : Cov 
It ranges in value from —1 to +1 and is equal to Ova, B 
B 


T4959 


Scatter plots are useful for revealing nonlinear relationships that are not measured 
by correlation. 


Correlation does not imply that changes in one variable cause changes in the other. 
Spurious correlation may result by chance, or from the relationships of two 
variables to a third variable. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 3.1 


1.A The winsorized mean substitutes a value for some of the largest and smallest 
observations. The trimmed mean removes some of the largest and smallest 
observations. (LOS 3.a) 


2.A The sample mean is [22% + 5% + -7% + 11% + 2% + 11%] / 6 = 7.3%. The 
sample standard deviation is the square root of the sample variance: 


_ li — 7.3)? + (5 — 7.3)? + (~7 — 7.3)? + (11 — 7.3)? + (2 — 7.3)? + (11 — 7.37 
6-1 
V96.3 = 9.8%. 
(LOS 3.b) 


3.A Here are deviations from the target return: 
22% — 11% = 11% 
5% — 11% = —6% 
—7% — 11% = —18% 
11% — 11% = 0% 
2% — 11% = —% 
11% — 11% = 0% 


Target downside deviation = 


poet C18 + COV _ REZ = 9.39%. 


(LOS 3.b) 
Module Quiz 3.2 


1.A A distribution with a mean greater than its median is positively skewed, or 
skewed to the right. The skew pulls the mean. Kurtosis deals with the overall 
shape of a distribution, not its skewness. (LOS 3.c) 


2.B A distribution that has a greater percentage of small deviations from the mean 
and a greater percentage of extremely large deviations from the mean will be 
leptokurtic and will exhibit excess kurtosis (positive). The distribution will be 
more peaked and have fatter tails than a normal distribution. (LOS 3.c) 


3.B A correlation of +0.25 indicates a positive linear relationship between the 
variables—one tends to be above its mean when the other is above its mean. 
The value 0.25 indicates that the linear relationship is not particularly strong. 
Correlation does not imply causation. (LOS 3.d) 


1. “Spurious Correlations,” Tyler Vigen, www.tylervigen.com 


READING 4 


PROBABILITY TREES AND 
CONDITIONAL EXPECTATIONS 


MODULE 4.1: PROBABILITY MODELS, a 
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FORMUL A Video covering 
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LOS 4.a: Calculate expected values, variances, and standard 
deviations and demonstrate their application to investment problems. 


The expected value of a random variable is the weighted average of the possible 
outcomes for the variable. The mathematical representation for the expected value 
of random variable X, that can take on any of the values from x, to x,, is: 


E(X) = UP(x;)x; = P(X;)X; + P(x,)x,+ sina P(x,)x, 
where: 
P(x;) = probability of outcome x; 


EXAMPLE: Expected earnings per share 


The probability distribution of earnings per share (EPS) for Ron’s Stores is given 
in the following figure. Calculate the expected EPS. 


EPS Probability Distribution 
Probability EPS 


10% £1.80 
20% £1.60 
40% £1.20 
30% £1.00 
100% 
Answer: 


The expected EPS is simply a weighted average of each possible EPS, where the 
weights are the probabilities of each possible outcome: 


E[EPS] = 0.10(1.80) + 0.20(1.60) + 0.40(1.20) + 0.30(1.00) = £1.28 


Variance and standard deviation measure the dispersion of a random variable 
around its expected value, sometimes referred to as the volatility of a random 


variable. Variance (from a probability model) can be calculated as the probability- 
weighted sum of the squared deviations from the mean (or expected value). The 
standard deviation is the positive square root of the variance. The following example 
illustrates the calculations for a probability model of possible returns. 


EXAMPLE: Expected value, variance, and standard deviation from a 
probability model 


Using the probabilities given in the following table, calculate the expected return 
on Stock A, the variance of returns on Stock A, and the standard deviation of 
returns on Stock A. 


Event |Prob| Ra | Prob x Ry {Ry — E(Ry)/[Ry — ER,)P| Prob x [Ry 


—ER,)/ 
Boom 0.00147 
Normal |50% 0.00005 
Slow 0.00128 
Var (R = 
0.00280 
Answer: 


E(Ra) = (0.30 x 0.20) + (0.50 x 0.12) + (0.20 x 0.05) = 0.13 = 13% 


The expected return for Stock A is the probability-weighted sum of the returns 
under the three different economic scenarios. 


In Column 5, we have calculated the differences between the returns under each 
economic scenario and the expected return of 13%. 


In Column 6, we squared all the differences from Column 5. In the final column, 
we have multiplied the probabilities of each economic scenario times the squared 
deviation of returns from the expected returns, and their sum, 0.00280, is the 
variance of Rg. 


The standard deviation of Ry = y0.0028 = 0.0529. 


Note that in a previous reading, we estimated the standard deviation of a 
distribution from sample data, rather than from a probability model of returns. For 
the sample standard deviation, we divided the sum of the squared deviations from 
the mean by n - 1, where n was the size of the sample. Here, we have no “n” because 
we have no observations; a probability model is forward-looking. We use the 
probability weights instead, as they describe the entire distribution of outcomes. 


LOS 4.b: Formulate an investment problem as a probability tree and explain 
the use of conditional expectations in investment application. 


You may wonder where the returns and probabilities used in calculating expected 
values come from. A general framework, called a probability tree, is used to show 
the probabilities of various outcomes. In Figure 4.1, we have shown estimates of EPS 
for four different events: (1) a good economy and relatively good results at the 


company, (2) a good economy and relatively poor results at the company, (3) a poor 
economy and relatively good results at the company, and (4) a poor economy and 
relatively poor results at the company. Using the rules of probability, we can 
calculate the probabilities of each of the four EPS outcomes shown in the boxes on 
the right-hand side of the probability tree. 
The expected EPS of $1.51 is simply calculated as follows: 

(0.18 x 1.80) + (0.42 x 1.70) + (0.24 x 1.30) + (0.16 x 1.00) = $1.51 
Note that the probabilities of the four possible outcomes sum to 1. 


Figure 4.1: A Probability Tree 


EPS = $1.80 
Prob = (60% =x 30%) 
= 18% 


Prob. of 
good results 
= 30% 


Prob. of 
good economy 


Expected EPS = _— Prob. of poor 
(18% = $1.80) + results = 70% 
(42% x $1.70) + 
(24% x $1.30) + 
(16% x $1.00) Prob. of good 
= $1.51 ae results = 60% 


poor economy 
= 40% 


EPS = $1.70 
Prob = (60% = 70%) 
= 42% 


EPS = $1.30 
Prob = (40% =x 60%) 
= 24% 


Prob. of 
poor results 
= 40% 


EPS =$1.00 
Prob = (40% =x 40%) 
= 16% 


Expected values or expected returns can be calculated using conditional 
probabilities. As the name implies, conditional expected values are contingent on 
the outcome of some other event. An analyst would use a conditional expected value 
to revise his expectations when new information arrives. 


Consider the effect a tariff on steel imports might have on the returns of a domestic 
steel producer’s stock in the previous example. The stock’s conditional expected 
return, given that the government imposes the tariff, will be higher than the 
conditional expected return if the tariff is not imposed. 


LOS 4.c: Calculate and interpret an updated probability in an investment 
setting using Bayes’ formula. 


Bayes’ formula is used to update a given set of prior probabilities for a given event 
in response to the arrival of new information. The rule for updating prior probability 
of an event is as follows: 


updated probability = 


probability of new information for a given event 


unconditional probability of new information ee 


We can derive Bayes’ formula using the multiplication rule and noting that P(AB) = 
P(BA): 


P(B| A) x P(A) = P(BA), and P(A|B) x P(B) = P(AB) 


Because P(BA) = P(AB), we can write P(B| A) P(A) = P(A|B) P(B), 
(A) 
and rele equals oS the joint probability of A and B divided by the 


unconditional probability of B. 
The following example illustrates the use of Bayes’ formula. Note that A is 
outperform and A° is underperform, P(BA) is (outperform + gains), P(ASB) is 
(underperform + gains), and the unconditional probability P(B) is P(AB) + P(A°B), by 
the total probability rule. 


EXAMPLE: Bayes’ formula 


There is a 60% probability that the economy will outperform—and if it does, 
there is a 70% probability a stock will go up and a 30% probability the stock will 
go down. There is a 40% probability the economy will underperform, and if it 
does, there is a 20% probability the stock in question will increase in value (have 
gains) and an 80% probability it will not. Given that the stock increased in value, 
calculate the probability that the economy outperformed. 


Answer: 


42% (outperform + gains) 


18% (outperform +no gains) 


8% (underperform + gains) 


32% (underperform + no gains) 


In the figure just presented, we have multiplied the probabilities to calculate the 
probabilities of each of the four outcome pairs. Note that these sum to 1. Given 
that the stock has gains, what is our updated probability of an outperforming 
economy? We sum the probability of stock gains in both states (outperform and 
underperform) to get 42% + 8% = 50%. Given that the stock has gains and using 


Bayes’ formula, the probability that the economy has outperformed is — = 84%. 


=I MODULE QUIZ 4.1 


1. Given the conditional probabilities in the following table and the unconditional 
probabilities P(Y = 1) = 0.8 and P(Y = 2) = 0.7, what is the expected value of X? 


x P(x; | ¥=1) P(x; | Y=2) 


0 0.2 0.1 
5 0.4 0.8 
10 0.4 0.1 
A. 5.0. 
B. 5.3. 


2. An analyst believes that Davies Company has a 40% probability of earning more 
than $2 per share. She estimates that the probability that Davies Company’s 
credit rating will be upgraded is 70% if its earnings per share (EPS) are greater 
than $2, and 20% if its EPS are $2 or less. Given the information that Davies 
Company’s credit rating has been upgraded, what is the updated probability that 
its EPS are greater than $2? 

A. 50%. 
B. 60%. 
C. 70%. 


KEY CONCEPTS 


LOS 4.a 
The expected value of a random variable is the weighted average of its possible 
outcomes: 

E(X) = 2P(xj)xj = P(x1)x1 + P(x2)x2 + ... + P(x) Xp 


Variance can be calculated as the probability-weighted sum of the squared 
deviations from the mean or expected value. The standard deviation is the positive 
square root of the variance. 


LOS 4.b 


A probability tree shows the probabilities of two events and the conditional 
probabilities of two subsequent events: 


Conditional expected values depend on the outcome of some other event. Forecasts 
of expected values for a stock’s return, earnings, and dividends can be refined, using 
conditional expected values, when new information arrives that affects the expected 
outcome. 


LOS 4.e 


Bayes’ formula for updating probabilities based on the occurrence of an event O is as 
follows: 


P(O|I) © 


P(I| O) = P(O) x P@ 


P(AC) 


Equivalently, based on the following tree diagram, P(A|C) = PAC) + PO): 


AC with Prob (AC) 


AD with Prob (AD) 


BC with Prob (BC) 


BD with Prob (BD) 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 4.1 


1.B E(X|Y = 1) = (0.2)(0) + (0.4)(5) + (0.4)(10) = 6 
E(X | Y = 2) = (0.1)(0) + (0.8)(5) + (0.1)(10) = 5 
E(X) = (0.3)(6) + (0.7)(5) = 5.30 
(LOS 4.a) 


2.C This is an application of Bayes’ formula. As the following tree diagram shows, 


oe : 28% _ : 
the updated probability that EPS are greater than $2 is 78%, 4 12% > 70%: 


28% upgrade and EPS > $2 
§2 
EPS” 
12% no upgrade and EPS > $2 
0% &e 12% upgrade and EPS < $2 
S$ 


48% no upgrade and EPS < $2 
(LOS 4.c) 


READING 5 
PORTFOLIO MATHEMATICS 


MODULE 5.1: PROBABILITY MODELS G 
FOR PORTFOLIO RETURN AND RISK 


Video covering 
this content is 


LOS 5.a: Calculate and interpret the expected value, variance, available online. 
standard deviation, covariances, and correlations of portfolio 
returns. 


The expected return of a portfolio composed of n assets with weights, w,, and 
expected returns, R;, can be determined using the following formula: 


E(Rp) = dw, ER,) = w,E(R,) + w,E(R,) + ... + Ww, E(R,) 
i=1 


The expected return and variance for a portfolio of assets can be determined using 
the properties of the individual assets in the portfolio. To do this, it is necessary to 
establish the portfolio weight for each asset. As indicated in the formula below, the 
weight, w, of Asset i is simply the market value currently invested in the asset 
divided by the current market value of the entire portfolio: 

__. market value of investment in Asset i 

: market value of the portfolio 
In many finance situations, we are interested in how two random variables move in 
relation to each other. For investment applications, one of the most frequently 
analyzed pairs of random variables is the returns of two assets. Investors and 
managers frequently ask questions such as, “What is the relationship between the 
return for Stock A and Stock B?” or, “What is the relationship between the 
performance of the S&P 500 and that of the automotive industry?” 


Covariance is a measure of how two assets move together. It is the expected value 
of the product of the deviations of the two random variables from their respective 
expected values. A common symbol for the covariance between random variables X 
and Y is Cov(X,Y). Because we will be mostly concerned with the covariance of asset 
returns, the following formula has been written in terms of the covariance of the 
return of Asset i, R;, and the return of Asset j, R;: 


Cov(R;,R,) E{[R; — Ey] [R; = E(R; )]} 


The following are properties of covariance: 


» The covariance of a random variable with itself is its variance; that is, Cov(R,,Ra) 
= Var(Ra). 

» Covariance may range from negative infinity to positive infinity. 

« A positive covariance indicates that when one random variable is above its mean, 
the other random variable also tends to be above its mean. 

» A negative covariance indicates that when one random variable is above its mean, 
the other random variable tends to be below its mean. 

The sample covariance for a sample of returns data can be calculated as follows: 

na 


5 ((8ss-F Rss%5)} 


SKY n—1 
where: 
R an observation of returns on Asset 1 


1i> 
Ri = an observation of returns on Asset 2 
R, = mean return of Asset 1 

R, = mean of Asset 2 

n = number of observations in the sample 


A covariance matrix shows the covariances between returns on a group of assets. 


Figure 5.1: Covariance Matrix for Assets A, B, and C 


Asset A B Cc 
A Cov(R,,R,) Covw(R,,R,) Cov(R,,R,) 
B Cov(Rz,R,) Cov(Rp,Rp) Cov(Rp,R,) 


c Cov(Rg,R,) Cov(RgRs) Cov(R,Ro) 


Note that the diagonal terms from top left are the variances of each asset’s returns— 
in other words, Cov(R,,Ra) = Var(Ra). 


The covariance between the returns on two assets does not depend on order—in 
other words, Cov(R,,Rpg) = Cov(Rp,Ra)—so in this covariance matrix, only three of 


the (off-diagonal) covariance terms are unique. In general for n assets, there are n 
variance terms (on the diagonal) and n(n — 1) / 2 unique covariance terms. 


With portfolio variance, to calculate the variance of portfolio returns, we use the 
asset weights, returns variances, and returns covariances: 


NN 
Var(Rp) = 2 PyMirjCorRi »R)) 
i=lj=1 


The variance of a portfolio composed of two risky assets, A and B, can be expressed 
as follows: 


Var(R ») = w,w,Cov(R,,R,) + W,Wp Cov(R,,Rp) + wgw, Cov(Rp,R,) + 
WW, Cov(R,,Rz) 


We can write this more simply as: 


Var(R,) = wi Var(R,) + wy Var(Rg) + 2w, w_Cov(R, Rg) 


or: 

of = wo, + weap + 2w, wRCovas 
For a three-asset portfolio, the portfolio variance is: 

of = wick + woo} + waod + 2w,weCovyg+ 2wy WoCOVaG+ 

2Wp Wc COVac 

Consider a portfolio with three assets: an index of domestic stocks (60%), an index 
of domestic bonds (30%), and an index of international equities (10%). A covariance 
matrix of the three assets is shown here. 


Figure 5.2: Covariance Matrix for the Three Assets 


Asset Domestic Domestic International 
Stocks Bonds Equities 
Domestic stocks 400 ae 180 
Domestic bonds ae 70 35 
International equities 180 35 450 


Portfolio returns variance = 


(0.62)400 + (0.32)70 + (0.17)450 + 2(0.6)(0.3)44 + 2(0.6)(0.1)180 + 
2(0.3)(0.1)35 = 194.34 


Portfolio returns standard deviation = — 194.34 = 13.94%. 


Note that the units of variance and covariance are %? (i.e., 0.001). When we put 
these values in as whole numbers (in %7), the portfolio variance is in %?, and the 
standard deviation is in whole percentages. We could also put variance and 
covariance in as decimals and get both the portfolio returns variance and standard 
deviation as decimals. 


From the formula for portfolio returns variance, we can see that the lower the 
covariance terms, the lower the portfolio variance (and standard deviation). This is 
true for positive values of covariance, as well as negative values. 


Recall that the correlation coefficient for two variables is: 
Covys 
PaAB= GR” so that (Cov,,) = pAgXo,op 
This can be substituted for Cova, in our formula for portfolio returns variance. With 
this substitution, we can use a correlation matrix to calculate portfolio returns 
variance, rather than using covariances. 


Figure 5.3: Correlation Matrix for the Three Assets 


Asset Domestic Domestic International 
Stocks Bonds Equities 

Domestic stocks 1.000 0.263 0.424 

Domestic bonds 0.263 1.000 0.197 


International equities 0.424 0.197 1.000 


Note that the correlations of asset returns with themselves (the diagonal terms) are 
all 1. 


LOS 5.b: Calculate and interpret the covariance and correlation of portfolio 
returns using a joint probability function for returns. 


EXAMPLE: Covariance of returns from a joint probability function 


Assume that the economy can be in three possible states (S) next year: boom, 
normal, or slow economic growth. An expert source has calculated that P(boom) 
= 0.30, P(normal) = 0.50, and P(slow) = 0.20. The returns for Asset A, Ra, and Asset 


B, Rp, under each of the economic states are provided in the probability model as 
follows. What is the covariance of the returns for Asset A and Asset B? 


Joint Probability Function 
Rg=30% Rg=10%  Rg=0% 


R,=20% 0.30 0 0 
R,=12% 0 0.50 0 
Ry =5% 0 0 0.20 


The table gives us the joint probability of returns on Assets A and B (e.g,, there is a 
30% probability that the return on Asset A is 20% and the return on Asset B is 
30%, and there is a 50% probability that the return on Asset A is 12% and the 
return on Asset B is 10%). 


Answer: 

First, we must calculate the expected returns for each of the assets: 
E(Ra) = (0.3)(0.20) + (0.5)(0.12) + (0.2)(0.05) = 0.13 
E(Rg) = (0.3)(0.30) + (0.5)(0.10) + (0.2)(0.00) = 0.14 


The covariance can now be computed using the procedure described in the 
following table. 


Covariance Calculation 


Probability Ry Rg Probability x [Ry — E(R,)] x [Rg — ER) 
0.3 0.20 0.30 — (0.30.2 — 0.13)(0.3 — 0.14) = 0.00336 
0.5 0.12 0.10 (0.50.12 — 0.13)(0.1 — 0.14) = 0.00020 
0.2 0.05 0.00 —_(0.2(0.05 — 0.130 — 0.14) = 0.00224 


The covariance of returns for Asset A and Asset B is 0.00336 + 0.00020 + 0.00224 
= 0.005784. 


LOS 5.c: Define shortfall risk, calculate the safety-first ratio, and identify an 
optimal portfolio using Roy’s safety-first criterion. 


Shortfall risk is the probability that a portfolio value or return will fall below a 
particular target value or return over a given period. 


Roy’s safety-first criterion states that the optimal portfolio minimizes the 
probability that the return of the portfolio falls below some minimum acceptable 
level. This minimum acceptable level is called the threshold level. Symbolically, 
Roy’s safety-first criterion can be stated as follows: 


minimize P(Rp < Ry) 
where: 
Rp = portfolio return 


Ry, = threshold level return 


If portfolio returns are normally distributed, then Roy’s safety-first criterion can be 
stated as follows: 
E(Rp)— Ry 


maximize safety-first ratio, which equals op 


The reasoning behind the safety-first criterion is illustrated in Figure 5.4. Assume an 
investor is choosing between two portfolios: Portfolio A, with an expected return of 
12% and standard deviation of returns of 18%, and Portfolio B, with an expected 
return of 10% and standard deviation of returns of 12%. The investor has stated that 
he wants to minimize the probability of losing money (negative returns). Assuming 
that returns are normally distributed, the portfolio with the larger safety-first ratio 
using 0% as the threshold return (R,) will be the one with the lower probability of 


negative returns. 
Figure 5.4: The Safety-First Criterion and Shortfall Risk 
A. Normally Distributed Returns 
Portfolio A: E(R) = 12% 6, = 18% Portfolio B: E(R) = 10% 0,= 12% 


Probability of returns < 0% 
— i.e. short fall risk 


\ 


iZ= 10-0 
SFR, = + = 0.667 SFR, = —~— = 0.883 


B. Standard Normal 


25.14% 20.33% 


-0.67 0 -0.883 0 


Panel B of Figure 5.4 relates the safety-first ratio to the standard normal 
distribution. Note that the safety-first ratio is the number of standard deviations 
below the mean. Thus, the portfolio with the larger safety-first ratio has the lower 
probability of returns below the threshold return, which is a return of 0% in our 
example. Using a z-table for negative values, we can find the probabilities in the left- 
hand tails as indicated. These probabilities (25% for Portfolio A and 20% for 


Portfolio B) are also the shortfall risk for a target return of 0%—that is, the 
probability of negative returns. Portfolio B has the higher safety-first ratio, which 
means it has the lower probability of negative returns. 


In summary, when choosing among portfolios with normally distributed returns 
using Roy’s safety-first criterion, there are two steps: 
. E(R,) — Ry 
Step 1: Calculate the safety-first ratio . —_*-__— 
P 


Step 2: Choose the portfolio that has the largest safety-first ratio. 


EXAMPLE: Roy’s safety-first criterion 


For the next year, the managers of a $120 million college endowment plan have 
set a minimum acceptable end-of-year portfolio value of $123.6 million. Three 
portfolios are being considered that have the expected returns and standard 
deviation shown in the first two rows of the following table. Determine which of 
these portfolios is the most desirable using Roy’s safety-first criterion and the 
probability that the portfolio value will fall short of the target amount. 


Answer: 


The threshold return is R;, = (123.6 - 120) / 120 = 0.030 = 3%. The safety-first 


ratios are shown in the following table. As indicated, the best choice is Portfolio A 
because it has the largest safety-first ratio. 


Roy’s Safety-First Ratios 


E(R,) 9% 11% 6.6% 

a, 12% 20% 8.2% 

SFRatio (9-3) (11—3) (6.6—3) 
O5=S5 0455" 0442995 


The probability of an ending value for Portfolio A less than $123.6 million (a 
return less than 3%) is simply F(-0.5), which we can find on the z-table for 
negative values. The probability is 0.3085 = 30.85%. 


MODULE QUIZ 5.1 
v— 
1. The correlation of returns between Stocks A and B is 0.50. The covariance between 
these two securities is 0.0043, and the standard deviation of the return of Stock B 
is 26%. The variance of returns for Stock A is: 
A. 0.0011. 
B. 0.03381. 


C. 0.2656. 
2. Given the following joint probability table for the returns on Assets P and Q: 
Q=7% Q=4% Q=0% 


P=15% 0.2 0 0 
P= 12% 0 0.2 0 
P—0% 0 0 0.6 


The covariance between P and Q is closest to: 


A. 18.0. 


B. 18.7. 
C. 19.3. 
3. Expected returns and standard deviations of returns for three portfolios are as 
follows: 
Portfolio A PortfolioB Portfolio C 
E(R,) 5% 11% 18% 
o 8% 21% 40% 


p 


Given a threshold level of return of 4%, the optimal portfolio using Roy’s safety- 
first criterion is: 


A. Portfolio A. 
B. Portfolio B. 
C. Portfolio C. 


KEY CONCEPTS 


LOS 5.a 
The expected return of a portfolio composed of n assets with weights, w;, and 
expected returns, R;, is: 


a 
E(Rp) = DwiER) = w,E(R;) + w2E(R;) + ... + w, E(R,) 
The variance of a portfolio composed of two risky assets, A and B, can be expressed 
as follows: 
of = Wyo, + weog + 2w, WRCovan 


where Cova is the expected value of the product of the deviations of the two 
assets’ returns from their respective expected values. 


The variance of a two-asset portfolio can also be expressed as follows: 
Var(Rp) = wi.o4 + weog t+ 2W,Wp5,TpPap 
where p A,B is the correlation of the two assets’ returns. 


LOS 5.b 
Given the joint probabilities for A; and B;, the covariance is calculated as follows: 


YP(A,B,)[A; — ECA)] [B, — EB) 
i=1 


LOS 5.c 
Shortfall risk is the probability that a portfolio’s value (or return) will fall below a 
specific value over a given period. 


The safety-first ratio for Portfolio P, based on a target return Ry, is: 


—R 
SFRatio = ais 


Greater safety-first ratios are preferred and indicate a smaller shortfall probability. 
Roy’s safety-first criterion states that the optimal portfolio minimizes shortfall risk. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 5.1 


CovR, Rs) 
PA Coma s) = Fay oRe)] 
2 Cov(Ry »Rg) 
oRa) = ao(Rg)Corr(R, ,Rp) 
0.0043 ]7 ; 
a lesexes) ~ 0.03312 = 0.0011 
(LOS 5.a) 


2.B Expected value of P = 0.2 x 15+0.2 x 12 +0.6x0=54% 
Expected value of Q=0.2 x 7+0.2x4+0.6x0=2.2% 


Covariance = 0.2 x (15 - 5.4) x (7 - 2.2) + 0.2 x (12 - 5.4) x (4- 2.2) + 0.6 (0 - 
5.4) (0 - 2.2) = 18.72 
(LOS 5.b) 
3.C Safety-first ratio for Portfolio A = (5 - 4) /8=0.125 
Safety-first ratio for Portfolio B = (11 - 4) / 21 = 0.300 
Safety-first ratio for Portfolio C = (18 - 4) / 40 = 0.350 


The largest value is 0.35, so Portfolio C has the smallest probability of a return 
below the threshold. (LOS 5.c) 


READING 6 
SIMULATION METHODS 


MODULE 6.1: LOGNORMAL Go 
DISTRIBUTIONS AND SIMULATION 
TR C HNIQUES Video covering 


this content is 
available online. 
LOS 6.a: Explain the relationship between normal and 
lognormal distributions and why the lognormal distribution is used to model 
asset prices when using continuously compounded asset returns. 


The lognormal distribution is generated by the function e*, where x is normally 


distributed. Because the natural logarithm, /n, of e* is x, the logarithms of 
lognormally distributed random variables are normally distributed, thus the name. 


Figure 6.1 illustrates the differences between a normal distribution and a lognormal 
distribution. 


Figure 6.1: Normal vs. Lognormal Distributions 


Normal Distribution Lognormal Distribution 


H 0 


Recall from our reading about rates and returns that we use the natural logarithm to 
calculate continuously compounded returns. The lognormal distribution is useful for 
modeling asset prices if we think of an asset’s future price as the result of a 
continuously compounded return on its current price. That is: 


Pr = Py e7o,T 


where: 
Py = asset price at time T 
Po = asset price at time 0 (today) 


rg,T = continuously compounded return on the asset from time 0 to time T 


Because continuously compounded returns are additive, we can divide the time from 
0 to T into shorter periods and state that rg 7 is the sum of the continuously 
compounded returns over each of these shorter periods. Then, if we assume each of 
these returns is normally distributed, we can state that rg 7 is normally distributed. 
Even if they are not normally distributed, the central limit theorem implies that 
their sum (rp 7) is approximately normally distributed. This allows us to say Pry is 
lognormally distributed, because it is proportional to the logarithm of a normally 
distributed variable. 


In many of the pricing models that we will see in the CFA curriculum, we assume 
returns are independently and identically distributed. If returns are 
independently distributed, past returns are not useful for predicting future returns. If 
returns are identically distributed, their mean and variance do not change over time 
(a property known as stationarity that is important in time series modeling, a Level 
II topic). 


LOS 6.b: Describe Monte Carlo simulation and explain how it can be used in 
investment applications. 


Monte Carlo simulation is a technique based on the repeated generation of one or 
more risk factors that affect security values to generate a distribution of security 
values. For each of the risk factors, the analyst must specify the parameters of the 
probability distribution that the risk factor is assumed to follow. A computer is then 
used to generate random values for each risk factor based on its assumed 
probability distributions. Each set of randomly generated risk factors is used with a 
pricing model to value the security. This procedure is repeated many times (100s, 
1,000s, or 10,000s), and the distribution of simulated asset values is used to draw 
inferences about the expected (mean) value of the security—and possibly the 
variance of security values about the mean as well. 


As an example, consider the valuation of stock options that can only be exercised on 
a particular date. The main risk factor is the value of the stock itself, but interest 
rates could affect the valuation as well. The simulation procedure would be to do the 
following: 


1. Specify the probability distributions of stock prices and of the relevant interest 
rate, as well as the parameters (e.g., mean, variance, skewness) of the distributions. 


2. Randomly generate values for both stock prices and interest rates. 

3. Value the options for each pair of risk factor values. 

4. After many iterations, calculate the mean option value and use that as your 
estimate of the option’s value. 


Monte Carlo simulation is used to do the following: 


= Value complex securities. 
» Simulate the profits/losses from a trading strategy. 


« Calculate estimates of value at risk (VaR) to determine the riskiness of a portfolio 
of assets and liabilities. 


= Simulate pension fund assets and liabilities over time to examine the variability of 
the difference between the two. 


» Value portfolios of assets that have nonnormal return distributions. 


An advantage of Monte Carlo simulation is that its inputs are not limited to the 
range of historical data. This allows an analyst to test scenarios that have not 
occurred in the past. The limitations of Monte Carlo simulation are that it is fairly 
complex and will provide answers that are no better than the assumptions about the 
distributions of the risk factors and the pricing/valuation model that is used. Also, 
simulation is not an analytic method, but a statistical one, and cannot offer the 
insights provided by an analytic method. 


LOS 6.c: Describe the use of bootstrap resampling in conducting a simulation 
based on observed data in investment applications. 


Resampling is another method for generating data inputs to use in a simulation. 
Often, we do not (or cannot) have data for a population, and can only approximate 
the population by sampling from it. (For example, we may think of the observed 
historical returns on an investment as a sample from the population of possible 
return outcomes.) To conduct resampling, we start with the observed sample and 
repeatedly draw subsamples from it, each with the same number of observations. 
From these samples, we can infer parameters for the population, such as its mean 
and variance. 


In our reading on Estimation and Inference, we will describe some of the available 
resampling techniques. One of these is known as bootstrap resampling. In 
bootstrap resampling, we draw repeated samples of size n from the full dataset, 
replacing the sampled observations each time so that they might be redrawn in 
another sample. We can then directly calculate the standard deviation of these 
sample means as our estimate of the standard error of the sample mean. 


Simulation using data from bootstrap resampling follows the same procedure as 
Monte Carlo simulation. The difference is the source and scope of the data. For 
example, if a simulation uses bootstrap resampling of historical returns data, its 
inputs are limited by the distribution of actual outcomes. 


~ 


= MODULE QUIZ 6.1 


1. For a lognormal distribution the: 
A. mean equals the median. 
B. probability of a negative outcome is zero. 
C. probability of a positive outcome is 50%. 


2. Which of the following is least likely to be a limitation of Monte Carlo analysis? 
A. Monte Carlo simulation is a statistical rather than an analytic method. 
B. Results of the analysis are no better than the assumptions used to generate it. 
C. Monte Carlo simulation is unable to provide answers to “what if’ questions. 


3. Which of the following is most likely a strength of bootstrapping? 
A. Offers a representation of the statistical features of a population. 
B. Provides only statistical estimates, not exact results. 
C. Inputs may be limited by the distribution of actual outcomes. 


KEY CONCEPTS 


LOS 6.a 


If x is normally distributed, e* follows a lognormal distribution. The lognormal 
distribution is useful for modeling an asset’s future price as the result of a 
continuously compounded return on its current price. 


If investment returns are independently distributed, past returns are not useful for 
predicting future returns. 


If investment returns are identically distributed, their mean and variance do not 
change over time. 


LOS 6.b 


Monte Carlo simulation uses randomly generated values for risk factors, based on 
their assumed distributions, to produce a distribution of possible security values. Its 
limitations are that it is fairly complex and will provide answers that are no better 
than the assumptions used. 


LOS 6.c 

Bootstrap resampling involves drawing repeated samples from a sample that 
represents the population, replacing the sampled observations each time so that 
they might be redrawn in another sample. The standard deviation of these sample 
means is an estimate of the standard error of the sample mean. As with all 
simulation techniques, its answers are no better than the assumptions used. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 6.1 


1.B A lognormally distributed variable is never negative. (LOS 6.a) 


2.C The ability to address “what if” questions (i-e., using input data outside the 
range that has been observed historically) is an advantage of Monte Carlo 
simulation. (LOS 6.b) 


3.A One of the strengths of bootstrapping is that it offers a good representation of 
the statistical features of a population. However, this method does not provide 
exact results, and the inputs can be limited by the distribution of actual 
outcomes. (LOS 6.c) 


READING 7 
ESTIMATION AND INFERENCE 


MODULE 7.1: SAMPLING TECHNIQUES Go 
AND THE CENTRAL LIMIT THEOREM 


Video covering 
this content is 


LOS 7.a: Compare and contrast simple random, stratified available online. 
random, cluster, convenience, and judgmental sampling and 
their implications for sampling error in an investment problem. 


Probability sampling refers to selecting a sample when we know the probability of 
each sample member in the overall population. With random sampling, each item 
is assumed to have the same probability of being selected. If we have a population of 
data and select our sample by using a computer to randomly select a number of 
observations from the population, each data point has an equal probability of being 
selected—we call this simple random sampling. If we want to estimate the mean 
profitability for a population of firms, this may be an appropriate method. 


Nonprobability sampling is based on either low cost and easy access to some data 
items, or on using the judgment of the researcher in selecting specific data items. 
Less randomness in selection may lead to greater sampling error. 


Probability Sampling Methods 


Simple random sampling is a method of selecting a sample in such a way that each 
item or person in the population being studied has the same likelihood of being 
included in the sample. As an example of simple random sampling, assume that you 
want to draw a sample of 5 items out of a group of 50 items. This can be 
accomplished by numbering each of the 50 items, placing them in a hat, and shaking 
the hat. Next, one number can be drawn randomly from the hat. Repeating this 
process (experiment) four more times results in a set of five numbers. The five 
drawn numbers (items) comprise a simple random sample from the population. 


In applications like this one, a random-number table or a computer random-number 
generator is often used to create the sample. Another way to form an approximately 
random sample is systematic sampling—selecting every nth member from a 
population. 


Stratified random sampling uses a classification system to separate the population 
into smaller groups based on one or more distinguishing characteristics. From each 
subgroup, or stratum, a random sample is taken and the results are pooled. The size 


of the samples from each stratum is based on the size of the stratum relative to the 
population. 


Stratified sampling is often used in bond indexing because of the difficulty and cost 
of completely replicating the entire population of bonds. In this case, bonds ina 
population are categorized (stratified) according to major bond risk factors 
including, but not limited to, duration, maturity, and coupon rate. Then, samples are 
drawn from each separate category and combined to form a final sample. 


To see how this works, suppose you want to construct a portfolio of 100 bonds that 
is indexed to a major municipal bond index of 1,000 bonds, using a stratified random 
sampling approach. First, the entire population of 1,000 municipal bonds in the 
index can be classified on the basis of maturity and coupon rate. Then, cells 
(stratum) can be created for different maturity /coupon combinations, and random 
samples can be drawn from each of the maturity/coupon cells. To sample from a cell 
containing 50 bonds with 2-to-4-year maturities and coupon rates less than 5%, we 
would select five bonds. The number of bonds drawn from a given cell corresponds 
to the cell’s weight relative to the population (index), or (50/1,000) x 100 = 5 bonds. 
This process is repeated for all the maturity/coupon cells, and the individual 
samples are combined to form the portfolio. 


By using stratified sampling, we guarantee that we sample five bonds from this cell. 
If we had used simple random sampling, there would be no guarantee that we would 
sample any of the bonds in the cell. Or, we may have selected more than five bonds 
from this cell. 


Cluster sampling is also based on subsets of a population, but in this case, we are 
assuming that each subset (cluster) is representative of the overall population with 
respect to the item we are sampling. For example, we may have data on personal 
incomes for a state’s residents by county. The data for each county is a cluster. 


In one-stage cluster sampling, a random sample of clusters is selected, and all the 
data in those clusters comprise the sample. In two-stage cluster sampling, random 
samples from each of the selected clusters comprise the sample. Contrast this with 
stratified random sampling, in which random samples are selected from every 
subgroup. 


To the extent that the subgroups do not have the same distribution as the entire 
population of the characteristic of interest, cluster sampling will have greater 
sampling error than simple random sampling. Two-stage cluster sampling can be 
expected to have greater sampling error than one-stage cluster sampling. Lower cost 
and less time required to assemble the sample are the primary advantages of cluster 
sampling, and it may be most appropriate for a smaller pilot study. 


Nonprobability Sampling Methods 


Convenience sampling refers to selecting sample data based on ease of access, 
using data that are readily available. Because such a sample is typically not random, 
sampling error will be greater. An analyst should initially look at the data before 
adopting a sampling method with less sampling error. 


Judgmental sampling refers to samples for which each observation is selected 
from a larger dataset by the researcher, based on one’s experience and judgment. As 


an example, a researcher interested in assessing company compliance with 
accounting standards may have experience suggesting that evidence of 
noncompliance is typically found in certain ratios derived from the financial 
statements. The researcher may select only data on these items. Researcher bias (or 
simply poor judgment) may lead to samples that have excessive sampling error. In 
the absence of bias or poor judgment, judgmental sampling may produce a more 
representative sample or allow the researcher to focus on a sample that offers good 
data on the characteristic or statistic of interest. 


An important consideration when sampling is ensuring that the distribution of data 
of interest is constant for the whole population being sampled. For example, judging 
a characteristic of U.S. banks using data from 2005 to 2015 may not be appropriate. 
Regulatory reform of the banking industry after the financial crisis of 2007-2008 
may have resulted in significant changes in banking practices, so that the mean of a 
statistic precrisis and its mean value across the population of banks postcrisis are 
quite different. Pooling the data over the entire period from 2005 to 2015 would not 
be appropriate if this is the case, and the sample mean calculated from these data 
would not be a good estimate of either precrisis or postcrisis mean values. 


LOS 7.b: Explain the central limit theorem and its importance for the 
distribution and standard error of the sample mean. 


The central limit theorem states that for simple random samples of size n from a 
population with a mean p and a finite variance o, the sampling distribution of the 
sample mean | approaches a normal probability distribution with mean up and a 
variance equal to s as the sample size becomes large. 


The central limit theorem is extremely useful because the normal distribution is 
relatively easy to apply to hypothesis testing and to the construction of confidence 
intervals. Specific inferences about the population mean can be made from the 
sample mean, regardless of the population’s distribution, as long as the sample size is 
sufficiently large, which usually means n 2 30. 


Important properties of the central limit theorem include the following: 


« Ifthe sample size n is sufficiently large (n = 30), the sampling distribution of the 
sample means will be approximately normal. Remember what’s going on here: 
random samples of size n are repeatedly being taken from an overall larger 
population. Each of these random samples has its own mean, which is itself a 
random variable, and this set of sample means has a distribution that is 
approximately normal. 

= The mean of the population, y, and the mean of the distribution of all possible 
sample means are equal. 


= The variance of the distribution of sample means is <. the population variance 


divided by the sample size. 


The standard error of the sample mean is the standard deviation of the 
distribution of the sample means. 


When the standard deviation of the population, o, is known, the standard error of the 
sample mean is calculated as: 


a o 
2 Va 


where: 


o,, = standard error of the sample mean 


o = standard deviation of the population 
n = size of the sample 


However, practically speaking, the population’s standard deviation is almost never 
known. Instead, the standard error of the sample mean must be estimated by 
dividing the standard deviation of the sample by va: 


s 


Ss Va 


EXAMPLE: Standard error of sample mean (unknown population variance) 


Suppose a sample contains the past 30 monthly returns for McCreary, Inc. The 
mean return is 2%, and the sample standard deviation is 20%. Calculate and 
interpret the standard error of the sample mean. 


Answer: 
Because o is unknown, this is the standard error of the sample mean: 


S_ = Wh _ 3.6%, 


z Vn ¥30 
This implies that if we took all possible samples of the size of 30 from McCreary’s 


monthly returns and prepared a sampling distribution of the sample means, the 
mean would be 2% with a standard error of 3.6%. 


EXAMPLE: Standard error of sample mean (unknown population variance) 


Continuing with our example, suppose that instead of a sample size of 30, we take 
a sample of the past 200 monthly returns for McCreary, Inc. To highlight the effect 
of sample size on the sample standard error, let’s assume that the mean return 
and standard deviation of this larger sample remain at 2% and 20%, respectively. 
Now, calculate the standard error of the sample mean for the 200-return sample. 


Answer: 


The standard error of the sample mean is computed as follows: 


2 2 20% 0, 
Ss — Vi — 200 — 1.4% 


The result of the preceding two examples illustrates an important property of 
sampling distributions. Notice that the value of the standard error of the sample 
mean decreased from 3.6% to 1.4% as the sample size increased from 30 to 200. This 
is because as the sample size increases, the sample mean gets closer, on average, to 
the true mean of the population. In other words, the distribution of the sample 
means about the population mean gets smaller and smaller, so the standard error of 
the sample mean decreases. 


LOS 7.c: Describe the use of resampling (bootstrap, jackknife) to estimate the 
sampling distribution of a statistic. 


Previously, we used the sample variance to calculate the standard error of our 
estimate of the mean. The standard error provides better estimates of the 
distribution of sample means when the sample is unbiased and the distribution of 
sample means is approximately normal. 


Two alternative methods of estimating the standard error of the sample mean 
involve resampling of the data. The first of these, termed the jackknife, calculates 
multiple sample means, each with one of the observations removed from the sample. 
The standard deviation of these sample means can then be used as an estimate of the 
standard error of sample means. The jackknife is a computationally simple tool and 
can be used when the number of observations available is relatively small. This 
method can remove bias from statistical estimates. 


The jackknife (so named because it is a handy and readily available tool) was 
developed when computational power was not as readily available and as low cost 
as today. A bootstrap method is more computationally demanding, but it has some 
advantages. To estimate the standard error of the sample mean, we draw repeated 
samples of size n from the full dataset (replacing the sampled observations each 
time). We can then directly calculate the standard deviation of these sample means 
as our estimate of the standard error of the sample mean. 


The bootstrap method can improve accuracy compared to using only the data ina 
single sample, and it can be used to construct confidence intervals for various 
statistics in addition to the mean, such as the median. This method can also be used 
to estimate the distributions of complex statistics, including those that do not have 
an analytic form. 


—» MODULE QUIZ 7.1 
1. A simple random sample is a sample drawn in such a way that each member of 
the population has: 
A. some chance of being selected in the sample. 
B. an equal chance of being included in the sample. 


C. a 1% chance of being included in the sample. 


2. To apply the central limit theorem to the sampling distribution of the sample 
mean, the sample is usually considered to be large if n is at least: 
A. 20. 
B. 25. 
C. 30. 


3. Which of the following techniques to improve the accuracy of confidence intervals 
on a statistic is most computationally demanding? 
A. Jackknife resampling. 
B. Systematic resampling. 
C. Bootstrap resampling. 


KEY CONCEPTS 


LOS 7.a 


Simple random sampling is a method of selecting a sample in such a way that each 
item or person in the population being studied has the same probability of being 
included in the sample. 


Stratified random sampling involves randomly selecting samples proportionally 
from subgroups that are formed based on one or more distinguishing characteristics 
of the data, so that random samples from the subgroups will have the same 
distribution of these characteristics as the overall population. 


Cluster sampling is also based on subgroups (not necessarily based on data 
characteristics) of a larger dataset. In one-stage cluster sampling, the sample is 
formed from randomly chosen clusters (subsets) of the overall dataset. In two-stage 
cluster sampling, random samples are taken from each of the randomly chosen 
clusters (subgroups). 


Convenience sampling refers to selecting sample data based on ease of access, using 
data that are readily available. Judgmental sampling refers to samples for which each 
observation is selected from a larger dataset by the researcher, based on the 
researcher’s experience and judgment. Both are examples of nonprobability 
sampling and are nonrandom. 


LOS 7.b 


The central limit theorem states that for a population with a mean pt and a finite 
variance o%, the sampling distribution of the sample mean of all possible samples of 
size n (for n = 30) will be approximately normally distributed with a mean equal to 
m anda variance equal to o7/n. 


The standard error of the sample mean is the standard deviation of the distribution 


of the sample means and is calculated as ox = = (where o, the population standard 
deviation, is known) and as sx = a (where s, the sample standard deviation, is used 


because the population standard deviation is unknown). 


LOS 7.c 


Two resampling techniques to improve our estimates of the distribution of sample 
statistics are the jackknife and bootstrap. With the jackknife, we calculate n sample 
means, one with each observation in a sample of size n removed, and base our 
estimate on the standard error of sample means of size n. This can remove bias from 
our estimates based on the sample standard deviation without resampling. 


With bootstrap resampling, we use the distribution of sample means (or other 
statistics) from a large number of samples of size n, drawn from a large dataset. 
Bootstrap resampling can improve our estimates of the distribution of various 
sample statistics and provide such estimates when analytical methods will not. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 7.1 


1.B Ina simple random sample, each element of the population has an equal 
probability of being selected. The 1% chance answer option allows for an equal 


chance, but only if there are 100 elements in the population from which the 
random sample is drawn. (LOS 7.a) 


2.C Sample sizes of 30 or greater are typically considered large. (LOS 7.b) 


3.C Bootstrap resampling, repeatedly drawing samples of equal size from a large 
dataset, is more computationally demanding than the jackknife. We have not 
defined systematic resampling as a specific technique. (LOS 7.c) 


READING 8 
HYPOTHESIS TESTING 


MODULE 8.1: HYPOTHESIS TESTING Gg 
BASICS 


Video covering 
this content is 


LOS 8.a: Explain hypothesis testing and its components, available online. 
including statistical significance, Type I and Type II errors, and 
the power of a test. 


A hypothesis is a statement about the value of a population parameter developed for 
the purpose of testing a theory or belief. Hypotheses are stated in terms of the 
population parameter to be tested, like the population mean, m. For example, a 
researcher may be interested in the mean daily return on stock options. Hence, the 
hypothesis may be that the mean daily return on a portfolio of stock options is 
positive. 


Hypothesis testing procedures, based on sample statistics and probability theory, 
are used to determine whether a hypothesis is a reasonable statement and should 
not be rejected, or if it is an unreasonable statement and should be rejected. The 
process of hypothesis testing consists of a series of steps shown in Figure 8.1. 


Figure 8.1: Hypothesis Testing Procedure 


State the hypothesis 
Select the appropriate test statistic 
Specify the level of significance 
State the decision rule regarding the hypothesis 
Collect the sample and calculate the sample statistics 
Make a decision regarding the hypothesis 

Make a decision based on the results of the test 
Source: Wayne W. Daniel and James C. Terrell, Business Statistics, Basic Concepts and Methodology, Houghton 
Mifflin, Boston, 1997 


The null hypothesis, designated Hp, is the hypothesis that the researcher wants to 
reject. It is the hypothesis that is actually tested and is the basis for the selection of 


the test statistics. The null is generally stated as a simple statement about a 
population parameter. A typical statement of the null hypothesis for the population 
mean is Hp: f= Up, where m is the population mean and mg is the hypothesized value 
of the population mean. 


PROFESSOR’S NOTE 
The null hypothesis always includes the “equal to” condition. 


The alternative hypothesis, designated H., is what is concluded if there is sufficient 
evidence to reject the null hypothesis and is usually what you are really trying to 
assess. Why? You can never really prove anything with statistics—when the null 
hypothesis is discredited, the implication is that the alternative hypothesis is valid. 


For a null hypothesis Ho: = Ug, the alternative hypothesis is Hp:  # Ug. Notice that 
the null and alternative hypotheses are mutually exclusive and exhaustive. They 
include all possible outcomes of the test (they are exhaustive), and no possible 
outcome of the test satisfies both hypotheses (they are mutually exclusive). 


Let’s look at the development of the decision rule for a two-tailed test using a z- 
distributed test statistic (a z-test) at a 5% level of significance, a = 0.05: 


=» At a=0.05, the computed test statistic is compared with the critical z-values of 
+1.96. The values of +1.96 correspond to +Za/2 = +Z9 925, which is the range of z- 
values within which 95% of the probability lies. These values are obtained from 
the cumulative probability table for the standard normal distribution (z-table), 
which is included in the Appendix section of this book. 

« Ifthe computed test statistic falls outside the range of critical z-values (i.e., test 
statistic > 1.96, or test statistic < -1.96), we reject the null and conclude that the 
sample statistic is sufficiently different from the hypothesized value. 

« Ifthe computed test statistic falls within the range +1.96, we conclude that the 
sample statistic is not sufficiently different from the hypothesized value (u = Uo, in 
this case), and we fail to reject the null hypothesis. 


The decision rule (rejection rule) for a two-tailed z-test at a = 0.05 can be stated as 
follows: 


Reject Hp if test statistic < —1.96 or test statistic > 1.96 


Figure 8.2 shows the standard normal distribution for a two-tailed hypothesis test 
using the z-distribution. Notice that the significance level of 0.05 means that there is 
0.05 / 2 = 0.025 probability (area) under each tail of the distribution beyond +1.96. 


Figure 8.2: Two-Tailed Hypothesis Test Using the Standard Normal (z) 
Distribution 


-1.96 1.96 


Reject H, Fail to Reject H, Reject H, 


Hypothesis testing involves two statistics: (1) the test statistic calculated from the 
sample data and (2) the critical value of the test statistic. The value of the 
computed test statistic relative to the critical value is a key step in assessing the 
validity of a hypothesis. 


A test statistic is calculated by comparing the point estimate of the population 
parameter with the hypothesized value of the parameter (i.e., the value specified in 
the null hypothesis). As indicated in the following expression, the test statistic is the 
difference between the sample statistic and the hypothesized value, scaled by the 
standard error of the sample statistic: 


sample statistic — hypothesized value 


test statistic = dard error of the sample statistic 


The standard error of the sample statistic is the adjusted standard deviation of the 
sample. When the sample statistic is the sample mean, x, the standard error of the 
sample statistic for sample size n, is calculated as: 


o 
o- 


| a Van 
when the population standard deviation, o, is known, or: 


Ss 
Ss Vi 
when the population standard deviation, o, is not known. In this case, it is estimated 


using the standard deviation of the sample, s. 


PROFESSOR’S NOTE 

Don't be confused by the notation here. A lot of the literature you will 

encounter in your studies simply uses the term ©, for the standard error 

of the test statistic, regardless of whether the population standard 

deviation or sample standard deviation was used in its computation. 
As you will soon see, a test statistic is a random variable that may follow one of 
several distributions, depending on the characteristics of the sample and the 
population. We will look at four distributions for test statistics: the t-distribution, 
the z-distribution (standard normal distribution), the chi-square distribution, and 
the F-distribution. The critical value for the appropriate test statistic—the value 
against which the computed test statistic is compared—depends on its distribution. 


Type I and Type II Errors 


Keep in mind that hypothesis testing is used to make inferences about the 
parameters of a given population on the basis of statistics computed for a sample 


that is drawn from that population. We must be aware that there is some probability 
that the sample, in some way, does not represent the population, and any conclusion 
based on the sample about the population may be made in error. 


When drawing inferences from a hypothesis test, there are two types of errors: 


= Type I error. This is the rejection of the null hypothesis when it is actually true. 


= Type Il error. This is the failure to reject the null hypothesis when it is actually 
false. 


The significance level is the probability of making a Type I error (rejecting the null 
when it is true) and is designated by the Greek letter alpha (a). For instance, a 
significance level of 5% (a = 0.05) means there is a 5% chance of rejecting a true 
null hypothesis. When conducting hypothesis tests, a significance level must be 
specified to identify the critical values needed to evaluate the test statistic. 


While the significance level of a test is the probability of rejecting the null 
hypothesis when it is true, the power of a test is the probability of correctly 
rejecting the null hypothesis when it is false. The power of a test is actually one 
minus the probability of making a Type II error, or 1 - P(Type II error). In other 
words, the probability of rejecting the null when it is false (power of the test) equals 
one minus the probability of not rejecting the null when it is false (Type II error). 
When more than one test statistic may be used, the power of the test for the 
competing test statistics may be useful in deciding which test statistic to use. 
Ordinarily, we wish to use the test statistic that provides the most powerful test 
among all possible tests. 


Figure 8.3 shows the relationship between the level of significance, the power of a 
test, and the two types of errors. 


Figure 8.3: Type I and Type II Errors in Hypothesis Testing 


Decision Tme Condition 


Do not reject H, Correct decision Incorrect decision 
Type I error 


Reject H, Incorrect decision Correct decision 


Type I error Power of the test 


Significance level, «, 1 — P(Type 
error) 


P(Type I error) 


Sample size and the choice of significance level (Type I error probability) will 
together determine the probability of a Type IJ error. The relation is not simple, 
however, and calculating the probability of a Type II error in practice is quite 
difficult. Decreasing the significance level (probability of a Type I error) from 5% to 
1%, for example, will increase the probability of failing to reject a false null (Type IJ 
error)—and, therefore, reduce the power of the test. Conversely, for a given sample 
size, we can increase the power of a test only with the cost that the probability of 
rejecting a true null (Type I error) increases. For a given significance level, we can 
decrease the probability of a Type II error and increase the power of a test only by 
increasing the sample size. 


The decision for a hypothesis test is to either reject the null hypothesis or fail to 
reject the null hypothesis. Note that it is statistically incorrect to say “accept” the 
null hypothesis; it can only be supported or rejected. The decision rule for rejecting 
or failing to reject the null hypothesis is based on the distribution of the test 
statistic. For example, if the test statistic follows a normal distribution, the decision 
rule is based on critical values determined from the standard normal distribution (z- 
distribution). Regardless of the appropriate distribution, it must be determined if a 
one-tailed or two-tailed hypothesis test is appropriate before a decision rule 
(rejection rule) can be determined. 


A decision rule is specific and quantitative. Once we have determined whether a 
one- or two-tailed test is appropriate, the significance level we require, and the 
distribution of the test statistic, we can calculate the exact critical value for the test 
statistic. Then, we have a decision rule of the following form: if the test statistic is 
(greater, less than) the value X, reject the null. 


The p-value is the probability of obtaining a test statistic that would lead to a 
rejection of the null hypothesis, assuming the null hypothesis is true. It is the 
smallest level of significance for which the null hypothesis can be rejected. 


=I MODULE QUIZ 8.1 
=k 
1. For a hypothesis test with a probability of a Type II error of 60% and a probability 
of a Type I error of 5%, which of the following statements is most accurate? 
A. The power of the test is 40%, and there is a 5% probability that the test 
statistic will exceed the critical value(s). 
B. There is a 95% probability that the test statistic will be between the critical 
values, if this is a two-tailed test. 
C. There is a 5% probability that the null hypothesis will be rejected when 
actually true, and the probability of rejecting the null when it is false is 40%. 
2. If the significance level of a test is 0.05 and the probability of a Type II error is 
0.15, what is the power of the test? 
A. 0.850. 
B. 0.950. 
C. 0.975. 


MODULE 8.2: TYPES OF HYPOTHESIS C) 
TESTS 


Video covering 
this content is 
available online. 


LOS 8.b: Construct hypothesis tests and determine their 
statistical significance, the associated Type I and Type II errors, 
and power of the test given a significance level. 


Here, we will show examples of hypothesis tests for population means and 
population variances: 


= For a hypothesis concerning the value of a population mean, we use a t-test (or a z- 
test if the sample size is large enough). 


= To test a hypothesis concerning the equality of two population means, we use a t- 
test. The nature of that test depends on whether the samples are independent (a 


difference in means test) or dependent (a paired comparisons test). 


= For a hypothesis concerning the value of a population variance, we use a chi- 
square test. 


= To test a hypothesis concerning the equality of two population variances, we use 
an F-test. 


Value of a Population Mean 


EXAMPLE: Hypothesis test of a population mean 


A researcher has gathered data on the daily returns on a portfolio of call options 
over a recent 250-day period. The mean daily return has been 0.1%, and the 
sample standard deviation of daily portfolio returns is 0.25%. The researcher 
believes that the mean daily portfolio return is not equal to zero. Construct a 
hypothesis test of the researcher’s belief. 


Answer: 


First, we need to specify the null and alternative hypotheses: 
Ho: bp = 0 versus H,: 1p = 0 


The t-distribution is used to test a hypothesis about the value of a population 
mean. With 250 observations, however, this sample is considered to be large, so 
the z-distribution is acceptable. Because our sample is so large, the critical values 
for the t and z are almost identical. Hence, there is almost no difference in the 
likelihood of rejecting a true null. 


At a5% level of significance, the critical z-values for a two-tailed test are +1.96, 
so the decision rule can be stated as: 

Reject Hp if test statistic < —1.96 or test statistic > +1.96 
Given a sample size of 250 with a standard deviation of 0.25%, the standard error 
can be computed as: 


s, = —S_ = 9.25% _ 0,0158% 
vad ¥250 


Using the standard error of the sample mean, our test statistic is: 


0.001 0.001 
(0.0025) = 9000158 ~ °39 
“V250 


Because 6.33 > 1.96, we can reject the null hypothesis that the mean daily option 
return is equal to zero. 


Difference Between Means (Independent Samples) 


We frequently want to know if there is a difference between the means of two 
populations. The t-test for differences between means requires that we are 
reasonably certain that our samples are independent and that they are taken from 
two populations that are normally distributed. 


PROFESSOR’S NOTE 
Please note the language of the LOS here. Candidates must “[c]onstruct 
hypothesis tests and determine their statistical significance ...” Certainly, 


you should know that this is a t-test, and that we reject the hypothesis of 
equality when the test statistic is outside the critical t-values. Don't 
worry about memorizing the following formulas. 

A pooled variance is used with the t-test for testing the hypothesis that the means of 


two normally distributed populations are equal, when the variances of the 
populations are unknown but assumed to be equal. 


Assuming independent samples, the t-statistic is computed as follows: 


(X; — Xz) — (44 — H) 


> 


P P 
Va, +a, 


t 


> (ay — 1)sf + (a2 — 1)83 

Ss? variance of the first sample 

Sy = variance of the second sample 

n, = number of observations in the first sample 

n, = number of observations in the second sample 
Note: degrees of freedom, df, is (n; + n> — 2) 


Because we assume that the variances are equal, we just add the variances of the 
two sample means to calculate the standard error in the denominator. 


The intuition here is straightforward. If the sample means are close together, the 
numerator of the t-statistic (and the t-statistic itself) are small, and we do not reject 
equality. If the sample means are far apart, the numerator of the t-statistic (and the 
t-statistic itself) are large, and we reject equality. Perhaps not as easy to remember 
is the fact that this test is only valid for two populations that are independent and 
normally distributed. 


EXAMPLE: Difference between means—equal variances 


Sue Smith is investigating whether the abnormal returns for acquiring firms 
during merger announcement periods differ for horizontal and vertical mergers. 
She estimates the abnormal returns for a sample of acquiring firms associated 
with horizontal mergers and a sample of acquiring firms involved in vertical 
mergers. Smith finds that abnormal returns from horizontal mergers have a mean 
of 1.0% and a standard deviation of 1.0%, while abnormal returns from vertical 
mergers have a mean of 2.5% and a standard deviation of 2.0%. 


Smith assumes the samples are independent, the population means are normally 
distributed, and the population variances are equal. 


Smith calculates the t-statistic as -5.474 and the degrees of freedom as 120. Using 
a 5% significance level, should Smith reject or fail to reject the null hypothesis 
that the abnormal returns to acquiring firms during the announcement period are 
the same for horizontal and vertical mergers? 


Answer: 


Because this is a two-tailed test, the structure of the hypotheses takes the 
following form: 


Ho: 4y — Hy = O versus H,: py — pp = 0 
where: 
Ut, = the mean of the abnormal returns for the horizontal mergers 


[lp = the mean of the abnormal returns for the vertical mergers 


From the following t-table segment, the critical t-value for a 5% level of 
significance at « / 2 = p = 0.025 with df = 120, is 1.980. 


Partial f-Table 


One-Tailed Probabilities (p) 
[p= 0.10 | p= 0.05 | p= 0.025 | 


120 1.980 
1.972 
Thus, the decision rule can be stated as follows: 
Reject Hg if t-statistic < -1.980 or t-statistic > 1.980 


The rejection region for this test is illustrated in the following figure. 
Decision Rule for Two-Tailed t-Test 
(a = 0.05, df = 120) 


2.5% 2.5% 


-1.980 1.980 
Reject H, Fail to Reject H, Reject H, 


Because the test statistic, —-5.474, falls to the left of the lower critical t-value, 
Smith can reject the null hypothesis and conclude that mean abnormal returns 
are different for horizontal and vertical mergers. 


Paired Comparisons (Means of Dependent Samples) 


While the test in the previous section was of the difference between the means of 
two independent samples, sometimes, our samples may be dependent. If the 
observations in the two samples both depend on some other factor, we can construct 
a paired comparisons test of whether the means of the differences between 
observations for the two samples are different. Dependence may result from an 
event that affects both sets of observations for a number of companies, or because 
observations for two firms over time are both influenced by market returns or 
economic conditions. 


For an example of a paired comparisons test, consider a test of whether the returns 
on two steel firms were equal over a five-year period. We can’t use the difference in 
means test because we have reason to believe that the samples are not independent. 
To some extent, both will depend on the returns on the overall market (market risk) 
and the conditions in the steel industry (industry-specific risk). In this case, our 
pairs will be the returns on each firm over the same time periods, so we use the 
differences in monthly returns for the two companies. The paired comparisons test 
is just a test of whether the average difference between monthly returns is 
significantly different from some value, typically zero, based on the standard error 
of the differences in monthly returns. 


Remember, the paired comparisons test also requires that the sample data be 
normally distributed. Although we frequently just want to test the hypothesis that 
the mean of the differences in the pairs is zero (1g, = 0), the general form of the test 
for any hypothesized mean difference, mg,, is as follows: 

Hp: pig = Hg, Versus H,: pg = pg, 

where: 


[lq = mean of the population of paired differences 
lg, = hypothesized mean of paired differences, which is commonly zero 


For one-tail tests, the hypotheses are structured as either of these: 

Ho: bg S Mg, Versus H,: 3 > pg,, or Hp: py > Wy, versus A: py < py, 
For the paired comparisons test, the t-statistic with n — 1 degrees of freedom is 
computed as follows: 


where: 
1 a 
d = sample mean difference >> d: 
i=1 
d. = difference between the ‘th pair of observations 


s 
Sq = standard error of the mean difference $ 


IS\(a;- a) 


- | 4 
S$; = sample standard deviation \ — 


n = number of paired observations 


EXAMPLE: Paired comparisons test 


Joe Andrews is examining changes in estimated betas for the common stock of 
companies in the telecommunications industry before and after deregulation. 
Andrews believes the betas may decline because of deregulation, because 
companies are no longer subject to the uncertainties of rate regulation—or that 
they may increase because there is more uncertainty regarding competition in 
the industry. Andrews calculates a t-statistic of 10.26 for this hypothesis test, 
based on a sample size of 39. Using a 5% significance level, determine whether 
there is a change in betas. 


Answer: 
Because the mean difference may be positive or negative, a two-tailed test is in 
order here. Thus, the hypotheses are structured as: 

Ho: pg = 0 versus H,: 5 = 0 
There are 39 — 1 = 38 degrees of freedom. Using the t-distribution, the two-tailed 
critical t-values for a 5% level of significance with df = 38 is +2.024. As indicated 
in the following table, the critical t-value of 2.024 is located at the intersection of 


the p = 0.025 column and the df = 38 row. The one-tailed probability of 0.025 is 
used because we need 2.5% in each tail for 5% significance with a two-tailed test. 


Partial t-Table 
df| One-Tailed Probabilities (p) 


p= 0.10] p=0.05 [p= 0.025 | 


Thus, this is the decision rule: 
Reject Hg if t-statistic < -2.024, or t-statistic > 2.024 


This decision rule is illustrated in the following figure. 


Decision Rule for a Two-Tailed Paired Comparisons Test 
(a= 0.05, df = 38) 


2.5% 2.5% 


- 2.024 2.024 


Reject H, Fail to Reject H, Reject H, 


The test statistic, 10.26, is greater than the critical t-value, 2.024—it falls in the 
rejection region to the right of 2.024 in the previous figure. Thus, we reject the 
null hypothesis of no difference, concluding that there is a statistically significant 
difference between mean firm betas before and after deregulation. 


Keep in mind that we have been describing two distinct hypothesis tests: one about 
the significance of the difference between the means of two populations and one 
about the significance of the mean of the differences between pairs of observations. 
Here are rules for when these tests may be applied: 


= The test of the differences in means is used when there are two independent 
samples. 


= A test of the significance of the mean of the differences between paired 
observations is used when the samples are not independent. 


PROFESSOR’S NOTE 

Again, the LOS here says “[c]onstruct hypothesis tests and determine 
their statistical significance ...’ We can’t believe candidates are expected 
to memorize these formulas (or that you would be a better analyst if you 
did). You should instead focus on the fact that both of these tests involve 
t-statistics and depend on the degrees of freedom. Also note that when 
samples are independent, you can use the difference in means test, and 
when they are dependent, we must use the paired comparison (mean 
differences) test. In that case, with a null hypothesis that there is no 
difference in means, the test statistic is simply the mean of the 
differences between each pair of observations, divided by the standard 
error of those differences. This is just a straightforward t-test of whether 
the mean of a sample is zero, which might be considered fair game for the 
exam. 


Value of a Population Variance 


The chi-square test is used for hypothesis tests concerning the variance of a 
normally distributed population. Letting o” represent the true population variance 
and 9“ represent the hypothesized variance, the hypotheses for a two-tailed test of 
a single population variance are structured as follows: 


. 2 He 2 
Hp: o og versus H,: o* = 06 


The hypotheses for one-tailed tests are structured as follows: 
H,: o< oe versus H:: o> a or H,: o> a versus H,: o< a 
Hypothesis testing of the population variance requires the use of a chi-square 


distributed test statistic, denoted y2. The chi-square distribution is asymmetrical 
and approaches the normal distribution in shape as the degrees of freedom increase. 


To illustrate the chi-square distribution, consider a two-tailed test with a 5% level 
of significance and 30 degrees of freedom. As displayed in Figure 8.4, the critical chi- 
square values are 16.791 and 46.979 for the lower and upper bounds, respectively. 
These values are obtained from a chi-square table, which is used in the same manner 
as a t-table. A portion of a chi-square table is presented in Figure 8.5. 


Note that the chi-square values in Figure 8.5 correspond to the probabilities in the 
right tail of the distribution. As such, the 16.791 in Figure 8.4 is from the column 
headed 0.975 because 95% + 2.5% of the probability is to the right of it. The 46.979 
is from the column headed 0.025 because only 2.5% probability is to the right of it. 
Similarly, at a 5% level of significance with 10 degrees of freedom, Figure 8.5 shows 
that the critical chi-square values for a two-tailed test are 3.247 and 20.483. 


Figure 8.4: Decision Rule for a Two-Tailed Chi-Square Test 


(cx = 0.05, df = 30) 


2.5% 
0 
16.791 46.979 
Reject HH, | Fail to RejectH, Reject H, 


Figure 8.5: Chi-Square Table 


Probability in Right Tail 


| 0.975] 0.95 | 0.90 | 0.1 | 0.05 | 0.025] 
| 2.700| 3.325 | 4.168 | 14.684 | 16.919 | 19.023 | 


The chi-square test statistic, xX’, with n — 1 degrees of freedom, is computed as 
follows: 


2 (n — 1)s* 
Xa- oz 
where: 


n= sample size 
s* = sample variance 
O° = hypothesized value for the population variance 


Similar to other hypothesis tests, the chi-square test compares the test statistic, x2_; 
, to acritical chi-square value at a given level of significance and n — 1 degrees of 
freedom. Because the chi-square distribution is bounded below by zero, chi-square 
values cannot be negative. 


EXAMPLE: Chi-square test for a single population variance 


Historically, the High-Return Equity Fund has advertised that its monthly returns 
have a standard deviation equal to 4%. This was based on estimates from the 
2005-2013 period. High-Return wants to verify whether this claim still 
adequately describes the standard deviation of the fund’s returns. High-Return 
collected monthly returns for the 24-month period between 2013 and 2015 and 
measured a standard deviation of monthly returns of 3.8%. High-Return 
calculates a test statistic of 20.76. Using a 5% significance level, determine if the 
more recent standard deviation is different from the advertised standard 
deviation. 


Answer: 


The null hypothesis is that the standard deviation is equal to 4% and, therefore, 
the variance of monthly returns for the population is (0.04)* = 0.0016. Because 
High-Return simply wants to test whether the standard deviation has changed, up 
or down, a two-sided test should be used. The hypothesis test structure takes this 
form: 


H,: of = 0.0016 versus H.: a” ~ 0.0016 
The appropriate test statistic for tests of variance is a chi-square statistic. 


With a 24-month sample, there are 23 degrees of freedom. Using the table of chi- 
square values in Appendix E of this book, for 23 degrees of freedom and 
probabilities of 0.975 and 0.025, we find two critical values, 11.689 and 38.076. 
Thus, this is the decision rule: 

Reject Ho if x? < 11.689, or x? > 38.076 


This decision rule is illustrated in the following figure. 


Decision Rule for a Two-Tailed Chi-Square Test of a Single Population 
Variance 


(= 0.05, df= 23) 


11.689 38.076 


Reject H, Fail to Reject H, Reject H, 


Because the computed test statistic, Me falls between the two critical values, we 
cannot reject the null hypothesis that the variance is equal to 0.0016. The 
recently measured standard deviation is close enough to the advertised standard 
deviation that we cannot say that it is different from 4%, at a 5% level of 
significance. 


Comparing Two Population Variances 


The hypotheses concerned with the equality of the variances of two populations are 
tested with an F-distributed test statistic. Hypothesis testing using a test statistic 
that follows an F-distribution is referred to as the F-test. The F-test is used under the 
assumption that the populations from which samples are drawn are normally 
distributed, and that the samples are independent. 


If we let 0,2 and 0,” represent the variances of normal Population 1 and Population 


2, respectively, the hypotheses for the two-tailed F-test of differences in the 
variances can be structured as follows: 


> ? > 7 
Hp: OT = 05 versus H:: O7 #95 


The one-sided test structures can be specified as follows: 


? ? 


- «2 ; es er eee eee 
Hy: oy < 03 Versus H,: Of > 95, Or Ho: oO; = 03 versus H,: of < 95 


The test statistic for the F-test is the ratio of the sample variances. The F-statistic is 
computed as follows: 


s? 
F= 

$3 
where: 


a = variance of the sample of n, observations drawn from Population 1 


S Z = variance of the sample of n, observations drawn from Population 2 


Note that n, - 1 and n, - 1 are the degrees of freedom used to identify the 
appropriate critical value from the F-table (provided in this book’s Appendix). 


PROFESSOR’S NOTE 
Always put the larger variance in the numerator (s,7). Following this 
convention means that we only have to consider the critical value for the 
right-hand tail. 
An F-distribution is presented in Figure 8.6. As indicated, the F-distribution is right 
skewed and is bounded by zero on the left-hand side. The shape of the F-distribution 
is determined by two separate degrees of freedom: the numerator degrees of 
freedom, df,, and the denominator degrees of freedom, df). 


Figure 8.6: F-Distribution 


numerator df, = 10, denominator df, =10 


0.27 3.72 


Fail to RejectH, Reject H, 


Note that when the sample variances are equal, the value of the test statistic is 1. 
The upper critical value is always greater than one (the numerator is significantly 
greater than the denominator), and the lower critical value is always less than one 
(the numerator is significantly smaller than the denominator). In fact, the lower 
critical value is the reciprocal of the upper critical value. For this reason, in practice, 
we put the larger sample variance in the numerator and consider only the upper 
critical value. 


EXAMPLE: F-test for equal variances 


Annie Cower is examining the earnings for two different industries. Cower 
suspects that the variance of earnings in the textile industry is different from the 
variance of earnings in the paper industry. To confirm this suspicion, Cower has 
looked at a sample of 31 textile manufacturers and a sample of 41 paper 
companies. She measured the sample standard deviation of earnings across the 
textile industry to be $4.30, and that of the paper industry companies to be $3.80. 
Cower calculates a test statistic of 1.2805. Using a 5% significance level, 
determine if the earnings of the textile industry have a different standard 
deviation than those of the paper industry. 


Answer: 


In this example, we are concerned with whether the variance of earnings for 
companies in the textile industry is equal to the variance of earnings for 
companies in the paper industry. As such, the test hypotheses can be 
appropriately structured as follows: 


) 


? ? 7? 
Hp: Oo; = 7 Versus H:: Oo; #95 


For tests of difference between variances, the appropriate test statistic is: 


where s; is the larger sample variance. 


Using the sample sizes for the two industries, the critical F-value for our test is 
found to be 1.94. This value is obtained from the table of the F-distribution for 
2.5% in the upper tail, with df, = 30 and df, = 40. Thus, if the computed F-statistic 
is greater than the critical value of 1.94, the null hypothesis is rejected. The 
decision rule, illustrated in the following figure, can be stated as follows: 

Reject Ho if F > 1.94 
Decision Rule for F-Test 


(cx = 0.05, df, = 30, df, = 40) 


0 1.94 


Fail to Reject H, | Reject H, 


Because the calculated F-statistic of 1.2805 is less than the critical F-statistic of 
1.94, Cower cannot reject the null hypothesis. Cower should conclude that the 
earnings variances of the industries are not significantly different from one 
another at a 5% level of significance. 


LOS 8.c: Compare and contrast parametric and nonparametric tests, and 
describe situations where each is the more appropriate type of test. 


Parametric tests rely on assumptions regarding the distribution of the population 
and are specific to population parameters. For example, the z-test relies upon a mean 
and a standard deviation to define the normal distribution. The z-test also requires 
that either the sample is large, relying on the central limit theorem to assure a 
normal sampling distribution, or that the population is normally distributed. 


Nonparametric tests either do not consider a particular population parameter or 
have few assumptions about the population that is sampled. Nonparametric tests are 
used when there is concern about quantities other than the parameters of a 
distribution, or when the assumptions of parametric tests can’t be supported. They 
are also used when the data are not suitable for parametric tests (e.g., ranked 
observations). 


Some situations where a nonparametric test is called for may include the following: 


1. The assumptions about the distribution of the random variable that support a 
parametric test are not met. An example would be a hypothesis test of the mean 
value for a variable that comes from a distribution that is not normal and is of 
small size, so that neither the t-test nor the z-test is appropriate. 


2. A nonparametric test is called for when data are ranks (an ordinal measurement 
scale) rather than values. 


3. The hypothesis does not involve the parameters of the distribution, such as testing 
whether a variable is normally distributed. We can use a nonparametric test, 
called a runs test, to determine whether data are random. A runs test provides an 
estimate of the probability that a series of changes (e.g, +, +, -, -, +, -,.-..) are 
random. 


=» MODULE QUIZ 8.2 
1. Which of the following assumptions is least likely required for the difference in 
means test based on two samples? 
A. The two samples are independent. 
B. The two populations are normally distributed. 


C. The two populations have known variances. 


2. The appropriate test statistic for a test of the equality of variances for two 
normally distributed random variables, based on two independent random 
samples, is the: 

A. t-test. 
B. F-test. 


C. x2 test. 


3. The appropriate test statistic to test the hypothesis that the variance of a 
normally distributed population is equal to 13 is the: 
A. t-test. 
B. F-test. 


C. x2 test. 


KEY CONCEPTS 


LOS 8.a 

The hypothesis testing process requires a statement of a null and an alternative 
hypothesis, the selection of the appropriate test statistic, specification of the 
significance level, a decision rule, the calculation of a sample statistic, a decision 
regarding the hypotheses based on the test, and a decision based on the test results. 


The null hypothesis is what the researcher wants to reject. The alternative 
hypothesis is what the researcher wants to support, and it is accepted when the null 
hypothesis is rejected. 


A Type! error is the rejection of the null hypothesis when it is actually true, while a 
Type II error is the failure to reject the null hypothesis when it is actually false. 


The significance level can be interpreted as the probability that a test statistic will 
reject the null hypothesis by chance when it is actually true (i-e., the probability of a 
Type I error). A significance level must be specified to select the critical values for 
the test. 


The power of a test is the probability of rejecting the null when it is false. The power 
of atest = 1 - P(Type II error). 


The p-value for a hypothesis test is the smallest significance level for which the 
hypothesis would be rejected. 


LOS 8.b 
With degrees of 
Hypothesis tests of: Use a: freedom: 
One population mean t-statistic n-1 
Two population means t-statistic n-1 


One population variance § Chi-square statistic n-1 


Two population variances Fstatistic n, —1,n,-1 


LOS 8.e 


Parametric tests, like the t-test, F-test, and chi-square test, make assumptions 
regarding the distribution of the population from which samples are drawn. 
Nonparametric tests either do not consider a particular population parameter or 
have few assumptions about the sampled population. Nonparametric tests are used 
when the assumptions of parametric tests can’t be supported, or when the data are 
not suitable for parametric tests. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 8.1 


1.C A Type! error is rejecting the null hypothesis when it is true. The probability of 
rejecting a false null is [1 - Prob Type II] = [1 — 0.60] = 40%, which is called the 
power of the test. The other answer choices are not necessarily true, because 
the null may be false and the probability of rejection unknown. (LOS 8.a) 


2.A The power ofa test is 1 - P(Type II error) = 1 - 0.15 = 0.85. (LOS 8.a) 


Module Quiz 8.2 


1.C The difference in means test does not require the two population variances to 
be known. (LOS 8.b) 


2.B The F-test is the appropriate test. (LOS 8.b) 


3.C A test of the population variance is a chi-square test. (LOS 8.b) 


READING 9 


PARAMETRIC AND NON- 
PARAMETRIC TESTS OF 
INDEPENDENCE 


MODULE 9.1: TESTS FOR Cr) 
INDEPENDENCE 


Video covering 
this content is 


LOS 9.a: Explain parametric and nonparametric tests of the available online. 
hypothesis that the population correlation coefficient equals 

zero, and determine whether the hypothesis is rejected at a given level of 
significance. 


Correlation measures the strength of the relationship between two variables. If the 
correlation between two variables is zero, there is no linear relationship between 
them. When the sample correlation coefficient for two variables is different from 
zero, we must address the question of whether the true population correlation 
coefficient (p) is equal to zero. The appropriate test statistic for the hypothesis that 
the population correlation equals zero, when the two variables are normally 
distributed, is as follows: 

rvn—2 

vi —r* 


where: 

r = sample correlation 

n = sample size 
This test statistic follows a t-distribution with n — 2 degrees of freedom. Note that 
the test statistic increases, not only with the sample correlation coefficient, but also 
with sample size. 


EXAMPLE: Test of the hypothesis that the population correlation coefficient 
equals zero 


A researcher computes the sample correlation coefficient for two normally 
distributed random variables as 0.35, based on a sample size of 42. Determine 
whether to reject the hypothesis that the population correlation coefficient is 
equal to zero at a 5% significance level. 


Answer: 


- 4: = 0.35 V42—2 
Our test statistic is aes 2.363. 
Using the t-table with 42 — 2 = 40 degrees of freedom for a two-tailed test anda 
significance level of 5%, we can find the critical value of 2.021. Because our 
computed test statistic of 2.363 is greater than 2.021, we reject the hypothesis 
that the population mean is zero and conclude that it is not equal to zero. That is, 
the two populations are correlated—in this case, positively. 


PROFESSOR’S NOTE 

The correlation coefficient we refer to here is the Pearson correlation 
coefficient, which is a measure of the linear relationship between two 
variables. There are other correlation coefficients that better measure 
the strength of any nonlinear relationship between two variables. 


The Spearman rank correlation test, a nonparametric test, can be used to test 
whether two sets of ranks are correlated. Ranks are simply ordered values. If there is 
a tie (equal values), the ranks are shared—so if second and third rank is the same, 
the ranks are shared, and each gets a rank of (2 + 3) / 2 =2.5. 


The Spearman rank correlation, r, (when all ranks are integer values), is calculated 
as follows: 


a r 
op a: 
—_—— 
° n(n*—1) 
where: 
rs = rank correlation 
n = sample size 
dj = difference between two ranks 
We can test the significance of the Spearman rank correlation calculated with the 
formula just listed using the same test statistic we used for estimating the 
significance of a parametric correlation coefficient: 


j 
in—2 
tT. Vn—z 


\l-; 
When the sample size is greater than 30, the test statistic follows a t-distribution 
with n - 2 degrees of freedom. 


LOS 9.b: Explain tests of independence based on contingency table data. 


A contingency or two-way table shows the number of observations from a sample 
that have a combination of two characteristics. Figure 9.1 is a contingency table 
where the characteristics are earnings growth (low, medium, or high) and dividend 
yield (low, medium, or high). We can use the data in the table to test the hypothesis 
that the two characteristics, earnings growth and dividend yield, are independent of 
each other. 


Figure 9.1: Contingency Table for Categorical Data 


Earnings Growth Dividend Yield 


Low [Medinm [High [Total 
Low 
Meda 
High [| | 25 [a | ae | 
Total [119 | 110_| 95 | 328 | 


We index our three categories of earnings growth from low to high with i = 1, 2, or 3, 
and our three categories of dividend yield from low to high with j = 1, 2, or 3. From 
the table, we see in Cell 1,1 that 28 firms have both low earnings growth and low 
dividend yield. We see in Cell 3,2 that 25 firms have high earnings growth and 
medium dividend yields. 


For our test, we are going to compare the actual table values to what the values 
would be if the two characteristics were independent. The test statistic is a chi- 
square test statistic calculated as follows: 


oo Spa Sy" 
Ai OE 


where: 

Oj; = the number of observations in Cell i,j: Row i and Column j (i.e., observed frequency) 
Ejj = the expected number of observations for Cell i,j 

r = the number of row categories 

c = the number of column categories 


The degrees of freedom are (r — 1) x (c - 1), which is 4 in our example for dividend 
yield and earnings growth. 
Eip the expected number of observations in Cell i,j, is: 


E., the expected number of observations in Cell i,j, is: 
total for Row i x total for Column j 
total for all columns and rows 
The expected number of observations for Cell 2,2 is: 


110 x 113 —_ 38.4 


324 
In calculating our test statistic, the term for Cell 2,2 is: 
(32—38.4)? 
“334 > 1.067 


Figure 9.2 shows the expected frequencies for each pair of categories in our earnings 
growth and dividend yield contingency table. 


Figure 9.2: Contingency Table for Expected Frequencies 


Earnings Growth Dividend Yield 
High 


tow | aes [Hi 
a Ee 


For our test statistic, we sum, for all nine cells, the squared difference between the 
expected frequency and observed frequency, divided by the expected frequency. The 
resulting sum is 27.43. 


Our degrees of freedom are (3 - 1) x (3 - 1) = 4. The critical value for a significance 
level of 5% (from the chi-square table in the Appendix) with 4 degrees of freedom is 
9.488. Based on our sample data, we can reject the hypothesis that the earnings 
growth and dividend yield categories are independent. 


“—) MODULE QUIZ 9.1 
—_ 
1. The test statistic for a Spearman rank correlation test for a sample size greater 
than 30 follows a: 
A. t-distribution. 
B. normal distribution. 
C. chi-square distribution. 


2. A contingency table can be used to test: 
A. a null hypothesis that rank correlations are equal to zero. 
B. whether multiple characteristics of a population are independent. 
C. the number of p-values from multiple tests that are less than adjusted critical 
values. 


3. For a parametric test of whether a correlation coefficient is equal to zero, it is least 
likely that: 
A. degrees of freedom aren - 1. 
B. the test statistic follows a t-distribution. 
C. the test statistic increases with a greater sample size. 


KEY CONCEPTS 


LOS 9.a 


To test a hypothesis that a population correlation coefficient equals zero, the 


appropriate test statistic is a t-statistic with n — 2 degrees of freedom, calculated as 
— where r is the sample correlation coefficient. 
V1—r? 


A nonparametric test of correlation can be performed when we have only ranks (e.g, 
deciles of investment performance). The Spearman rank correlation test examines 
whether the ranks for multiple periods are correlated. The rank correlation is 


Oe 
_ i 
n(n? —1) 
nis the number of sample periods. The test statistic follows a t-distribution for 
samples sizes greater than 30. 


_ , where d? is the sum of the squared differences in pairs of ranks and 


LOS 9.b 


A contingency table can be used to test the hypothesis that two characteristics 
(categories) of a sample of items are independent. A contingency table shows the 
number of the sample items (e.g., firms that have both of two characteristics). The 
test statistic follows a chi-square distribution and is calculated as follows: 


O; — the number of observations in Cell i,j: Row ¢ and Column j(i.e., observed 


frequency) 

E; _. the expected number of observations for Cell i,j of the contingency table 
with independence 

r  . the number of row categories 

c = the number of column categories 


The degrees of freedom are (r — 1) x (c - 1). If the test statistic is greater than the 
critical chi-square value for a given level of significance, we reject the hypothesis 
that the two characteristics are independent. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 9.1 


1.A The test statistic for a Spearman rank correlation test follows a t-distribution. 
(LOS 9.a) 


2.B A contingency table is used to determine whether two characteristics of a 
group are independent. (LOS 9.b) 


3.A Degrees of freedom are n - 2 for a test of the hypothesis that correlation is 
equal to zero. The test statistic increases with sample size (degrees of freedom 
increase) and follows a t-distribution. (LOS 9.a) 
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LOS 10.a: Describe a simple linear regression model, how the —_2vailable online. 
least squares criterion is used to estimate regression 
coefficients, and the interpretation of these coefficients. 


The purpose of simple linear regression is to explain the variation in a dependent 
variable in terms of the variation in a single independent variable. Here, the term 
variation is interpreted as the degree to which a variable differs from its mean value. 
Don't confuse variation with variance—they are related, but they are not the same. 


2. = 
variation in Y = 2 Y;-Y) 


« The dependent variable is the variable whose variation is explained by the 
independent variable. We are interested in answering the question, “What explains 
fluctuations in the dependent variable?” The dependent variable is also referred to 
as the terms explained variable, endogenous variable, or predicted variable. 


» The independent variable is the variable used to explain the variation of the 
dependent variable. The independent variable is also referred to as the terms 
explanatory variable, exogenous variable, or predicting variable. 


EXAMPLE: Dependent vs. independent variables 


Suppose you want to predict stock returns with GDP growth. Which variable is 
the independent variable? 


Answer: 


Because GDP is going to be used as a predictor of stock returns, stock returns are 
being explained by GDP. Hence, stock returns are the dependent (explained) 
variable, and GDP is the independent (explanatory) variable. 


Suppose we want to use excess returns on the S&P 500 (the independent variable) to 
explain the variation in excess returns on ABC common stock (the dependent 
variable). For this model, we define excess return as the difference between the 
actual return and the return on 1-month Treasury bills. 


We would start by creating a scatter plot with ABC excess returns on the vertical 
axis and S&P 500 excess returns on the horizontal axis. Monthly excess returns for 
both variables from June 20X2 to May 20X5 are plotted in Figure 10.1. For example, 
look at the point labeled May 20X4. In that month, the excess return on the S&P 500 
was —7.8%, and the excess return on ABC was 1.1%. 


Figure 10.1: Scatter Plot of ABC Excess Returns vs. S&P 500 Index Excess Returns 


May 20x4 
ABC excess return = 1.1% 
S&P 500 excess return = -7.8% 


S&P 500 


excess _7150% 3% 
returns . ee 


-12% 


ABC 
excess returns 


The two variables in Figure 10.1 appear to be positively correlated: excess ABC 
returns tended to be positive (negative) in the same month that S&P 500 excess 
returns were positive (negative). This is not the case for all the observations, 
however (for example, May 20X4). In fact, the correlation between these variables is 
approximately 0.40. 


Simple Linear Regression Model 


The following linear regression model is used to describe the relationship between 
two variables, X and Y: 


Y; = bp + bX; +, ..1=1,..,0 
where: 


Y, = fth observation of the dependent variable, Y 

X; = ith observation of the independent variable, X 

bp = regression intercept term 

b, = regression slope coefficient 

¢; = residual for the ‘th observation (also referred to as the disturbance term or 
error term) 


Based on this regression model, the regression process estimates an equation for a 
line through a scatter plot of the data that “best” explains the observed values for Y 
in terms of the observed values for X. 


The linear equation, often called the line of best fit or regression line, takes the 
following form: 


Y;=b)+ b,X;,i1=1,2,3...,.0 
where: 

2. estimated value of Y; given X; 
bo estimated intercept term 


d, estimated slope coefficient 


PROFESSOR’S NOTE 
The hat «~» above a variable or parameter indicates a predicted value. 


The regression line is just one of the many possible lines that can be drawn through 
the scatter plot of X and Y. The criteria used to estimate this line is the essence of 
linear regression. The regression line is the line that minimizes the sum of the 
squared differences (vertical distances) between the Y-values predicted by the 


regression equation (Y, = 6, + 6, X,) and the actual Y-values, Y;. The sum of the 


squared vertical distances between the estimated and actual Y-values is referred to 
as the sum of squared errors (SSE). 


Thus, the regression line is the line that minimizes the SSE. This explains why simple 
linear regression is frequently referred to as ordinary least squares (OLS) 
regression, and the values determined by the estimated regression equation, ¥, are 
called least squares estimates. 


The estimated slope coefficient (5 ,) for the regression line describes the change in 
Y for a one-unit change in X. It can be positive, negative, or zero, depending on the 
relationship between the regression variables. The slope term is calculated as 
follows: 


~ Cov 


oa 
x 

The intercept term (b,) is the line’s intersection with the Y-axis at X = 0. It can be 

positive, negative, or zero. A property of the least squares method is that the 

intercept term may be expressed as follows: 


where: 

Y = mean of Y 

X = mean of X 
The intercept equation highlights the fact that the regression line passes through a 
point with coordinates equal to the mean of the independent and dependent 
variables (i.e., the point X, Y). 


EXAMPLE: Computing the slope coefficient and intercept term 


Compute the slope coefficient and intercept term using the following information: 
Cov(S&P 500, ABC) = 0.000336 Mean return, S&P 500 = —2.70% 
Var(S&P 500) = 0.000522 Mean return, ABC = —4.05% 


Answer: 
The slope coefficient is calculated as 6, = 0.000336 / 0.000522 = 0.64.. 


The intercept term is calculated as follows: 


by = ABC — 6, S&P 500 = — 4.05% — 0.64(—2.70%) = —2.3% 


The estimated regression line that minimizes the SSE in our ABC stock return 
example is shown in Figure 10.2. 


This regression line has an intercept of -2.3% anda slope of 0.64. The model 
predicts that if the S&P 500 excess return is —7.8% (May 20X4 value), then the ABC 
excess return would be —2.3% + (0.64)(-—7.8%) = -7.3%. The residual (error) for the 
May 20X4 ABC prediction is 8.4%—the difference between the actual ABC excess 
return of 1.1% and the predicted return of -7.3%. 


Figure 10.2: Estimated Regression Equation for ABC vs. S&P 500 Excess Returns 
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Interpreting a Regression Coefficient 


The estimated intercept represents the value of the dependent variable at the point 
of intersection of the regression line and the axis of the dependent variable (usually, 
the vertical axis). In other words, the intercept is an estimate of the dependent 
variable when the independent variable is zero. 


We also mentioned earlier that the estimated slope coefficient is interpreted as the 
expected change in the dependent variable for a one-unit change in the independent 
variable. For example, an estimated slope coefficient of 2 would indicate that the 
dependent variable is expected to change by two units for every one-unit change in 
the independent variable. 


EXAMPLE: Interpreting regression coefficients 


In the previous example, the estimated slope coefficient was 0.64 and the 
estimated intercept term was —2.3%. Interpret each coefficient estimate. 


Answer: 


The slope coefficient of 0.64 can be interpreted to mean that when excess S&P 
500 returns increase (decrease) by 1%, ABC excess returns is expected to increase 
(decrease) by 0.64%. 


The intercept term of —2.3% can be interpreted to mean that when the excess 
return on the S&P 500 is zero, the expected return on ABC stock is -2.3%. 


PROFESSOR’S NOTE 

The slope coefficient in a regression of the excess returns of an individual 
security (the y-variable) on the return on the market (the x-variable) is 
called the stock’s beta, which is an estimate of systematic risk of ABC 
stock. Notice that ABC is less risky than the average stock, because its 
returns tend to increase or decrease by less than the overall change in the 
market returns. A stock with a beta (regression slope coefficient) of 1 has 
an average level of systematic risk, and a stock with a beta greater than 1 
has more-than-average systematic risk. We will apply this concept in the 
Portfolio Management topic area. 


Keep in mind, however, that any conclusions regarding the importance of an 
independent variable in explaining a dependent variable are based on the statistical 
significance of the slope coefficient. The magnitude of the slope coefficient tells us 
nothing about the strength of the linear relationship between the dependent and 
independent variables. A hypothesis test must be conducted, or a confidence interval 
must be formed, to assess the explanatory power of the independent variable. Later 
in this reading we will perform these hypothesis tests. 


LOS 10.b: Explain the assumptions underlying the simple linear regression 
model, and describe how residuals and residual plots indicate if these 
assumptions may have been violated. 


Linear regression is based on numerous assumptions. Most of the major assumptions 
pertain to the regression model’s residual term (¢). Linear regression assumes the 
following: 


1. A linear relationship exists between the dependent and the independent variables. 


2. The variance of the residual term is constant for all observations 
(homoskedasticity). 


3. The residual term is independently distributed; that is, the residual for one 
observation is not correlated with that of another observation (or, the paired x and 
y observations are independent of each other). 


4. The residual term is normally distributed. 


Linear Relationship 


A linear regression model is not appropriate when the underlying relationship 
between X and Y is nonlinear. In Panel A of Figure 10.3, we illustrate a regression line 
fitted to a nonlinear relationship. Note that the prediction errors (vertical distances 
from the dots to the line) are positive for low values of X, then increasingly negative 
for higher values of X, and then turning positive for still-greater values of X. One way 
of checking for linearity is to examine the model residuals (prediction errors) in 
relation to the independent regression variable. In Panel B, we show the pattern of 


residuals over the range of the independent variable: positive, negative, then 
positive. 


Figure 10.3: Nonlinear Relationship 


a. Fitted vs. Observed 
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Homoskedasticity 


Homoskedasticity refers to the case where prediction errors all have the same 
variance. Heteroskedasticity refers to the situation when the assumption of 
homoskedasticity is violated. Figure 10.4, Panel A shows a scatter plot of 
observations around a fitted regression line where the residuals (prediction errors) 
increase in magnitude with larger values of the independent variable X. Panel B 
shows the residuals plotted versus the value of the independent variable, and it also 
illustrates that the variance of the error terms is not likely constant for all 
observations. 


Figure 10.4: Heteroskedasticity 


a. Fitted vs. Observed 
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b. Residuals vs. Independent Variable 
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Another type of heteroskedasticity results if the variance of the error term changes 
over time (rather than with the magnitude of the independent variable). We could 
observe this by plotting the residuals from a linear regression model versus the 
dates of each observation and finding that the magnitude of the errors exhibits a 
pattern of changing over time. To illustrate this, we could plot the residuals versus a 
time index (as the x-variable). Residuals would exhibit a pattern of increasing over 
time. 


Independence 


Suppose we collect a company’s monthly sales and plot them against monthly GDP 
as in Figure 10.5, Panel A, and observe that some prediction errors (the unfilled 
dots) are noticeably larger than others. To investigate this, we plot the residuals 
versus time, as in Panel B. The residuals plot illustrates that there are large negative 
prediction errors every 12 months (in December). This suggests that there is 
seasonality in sales such that December sales (the unfilled dots in Figure 10.5) are 
noticeably farther from their predicted values than sales for the other months. If the 
relationship between X and Y is not independent, the residuals are not independent, 
and our estimates of the model parameters’ variances will not be correct. 


Figure 10.5: Independence 


a. Sales vs. GDP 
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Normality 


When the residuals (prediction errors) are normally distributed, we can conduct 
hypothesis testing for evaluating the goodness of fit of the model (discussed later). 
With a large sample size, based on the central limit theorem, our parameter 
estimates may be valid, even when the residuals are not normally distributed. 


Outliers are observations (one or a few) that are far from our regression line (have 
large prediction errors or X values that are far from the others). Outliers will 
influence our parameter estimates so that the OLS model will not fit the other 
observations well. 


—» MODULE QUIZ 10.1 


1. What is the most appropriate interpretation of a slope coefficient estimate equal to 

10.0? 

A. The predicted value of the dependent variable when the independent variable is 
0 is 10.0. 

B. For every 1-unit change in the independent variable, the model predicts that 
the dependent variable will change by 10 units. 

C. For every 1-unit change in the independent variable, the model predicts that 
the dependent variable will change by 0.1 units. 


2. Which of the following is least likely a necessary assumption of simple linear 
regression analysis? 
A. The residuals are normally distributed. 
B. There is a constant variance of the error term. 
C. The dependent variable is uncorrelated with the residuals. 


MODULE 10.2: ANALYSIS OF VARIANCE Cc) 
(ANOVA) AND GOODNESS OF FIT 


Video covering 
LOS 10.c: Calculate and interpret measures of fit and formulate and evaluate _ this content is 
tests of fit and of regression coefficients in a simple linear regression. available online. 


LOS 10.d: Describe the use of analysis of variance (ANOVA) in regression 
analysis, interpret ANOVA results, and calculate and interpret the standard error of estimate 
in a simple linear regression. 


Analysis of variance (ANOVA) is a statistical procedure for analyzing the total 
variability of the dependent variable. Let’s define some terms before we move on to 
ANOVA tables: 


= The total sum of squares (SST) measures the total variation in the dependent 
variable. SST is equal to the sum of the squared differences between the actual Y- 
values and the mean of Y: 


SST 2 ae sy 


» The sum of squares regression (SSR) measures the variation in the dependent 
variable that is explained by the independent variable. SSR is the sum of the 
squared distances between the predicted Y-values and the mean of Y: 


SSR 2 -Y) 
1 


» The mean square regression (MSR) is the SSR divided by the number of 
independent variables. A simple linear regression has only one independent 
variable, so in this case, MSR = SSR. 


PROFESSOR’S NOTE 
Multiple regression (i.e., with more than one independent variable) is 
addressed in the Level II CFA curriculum. 


=» The sum of squared errors (SSE) measures the unexplained variation in the 
dependent variable. It’s also known as the sum of squared residuals or the residual 
sum of squares. SSE is the sum of the squared vertical distances between the 
actual Y-values and the predicted Y-values on the regression line: 


SSE 2 ¥:—¥) 
L 


» The mean squared error (MSE) is the SSE divided by the degrees of freedom, 
which is n - 1 minus the number of independent variables. A simple linear 
regression has only one independent variable, so in this case, degrees of freedom 
aren -— 2. 


You probably will not be surprised to learn the following: 


total variation = explained variation + unexplained variation 
or: 
SST = SSR + SSE 


Figure 10.6 illustrates how the total variation in the dependent variable (SST) is 
composed of SSR and SSE. 


Figure 10.6: Components of Total Variation 
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The output of the ANOVA procedure is an ANOVA table, which is a summary of the 
variation in the dependent variable. ANOVA tables are included in the regression 
output of many statistical software packages. You can think of the ANOVA table as 
the source of the data for the computation of many of the regression concepts 
discussed in this reading. A generic ANOVA table for a simple linear regression (one 
independent variable) is presented in Figure 10.7. 


Figure 10.7; ANOVA Table for a Simple Linear Regression 


Source of Degrees of Sum of Mean Sum of 
Variation Freedom Squares Squares 
Regression (explained) 1 SSR MSR Sok SSR SSR 
Error (unexplained) n—2 SSE MSE SSE. 
Total n-1 SST 


Standard Error of Estimate (SEE) 


The SEE for a regression is the standard deviation of its residuals. The lower the SEE, 
the better the model fit: 


SEE = VMSE 


Coefficient of Determination (R2) 
The coefficient of determination (R’) is defined as the percentage of the total 
variation in the dependent variable explained by the independent variable. For 


example, an R* of 0.63 indicates that the variation of the independent variable 
explains 63% of the variation in the dependent variable: 


R2 = SSR / SST 


PROFESSOR’S NOTE 

For simple linear regression (i.e. with one independent variable), the 
coefficient of determination, R?, may be computed by simply squaring 
the correlation coefficient, r. In other words, R2 = r* fora regression with 
one independent variable. 


EXAMPLE: Using the ANOVA table 


Given the following ANOVA table based on 36 observations, calculate the R? and 
the standard error of estimate (SEE). 


Completed ANOVA table for ABC regression 


Source of Degrees of Sum of Mean Sum of 
Variation Freedom Squares Squares 
Regression (explained) 1 0.0076 0.0076 
Error (unexplained) 34 0.0406 0.0012 
Total 35 0.0482 
Answer: 


> __ explained variation(SSR) —_ 0.0076 
‘total variation (SST) —-0.0482 


SEE = / MSE = 70.0012 = 0.035 


= 0.158 or 15.8% 


The F-Statistic 


An F-test assesses how well a set of independent variables, as a group, explains the 
variation in the dependent variable. 


The F-statistic is calculated as follows: 


F: MSR SSR/k 
MSE SSE/n-k-1 
where: 


MSR = mean regression sum of squares 

MSE = mean squared error 

Important: This is always a one-tailed test! 
For simple linear regression, there is only one independent variable, so the F-test is 
equivalent to a t-test of the statistical significance of the slope coefficient: 

Hg: by = 0 versus Hy: by # 0 
To determine whether b, is statistically significant using the F-test, the calculated F- 
statistic is compared with the critical F-value, F,, at the appropriate level of 
significance. The degrees of freedom for the numerator and denominator with one 
independent variable are as follows: 

dfnumerator = k= 1 

dfdenominator = - k-1=n-2 

where: 

n = number of observations 


The decision rule for the F-test is to reject Hg if F > F.. 


Rejecting the null hypothesis that the value of the slope coefficient equals zero at a 
stated level of significance indicates that the independent variable and the 
dependent variable have a significant linear relationship. 


EXAMPLE: Calculating and interpreting the F-statistic 


Use the ANOVA table from the previous example to calculate and interpret the F- 
statistic. Test the null hypothesis at the 5% significance level that the slope 
coefficient is equal to 0. 


Answer: 


MSR 0.0076, 
= MSE ~ 0.0012 °°? 


Ol neiretion =k=1 
Gigenominerer =n-k-1=36-1-1=34 


The null and alternative hypotheses are hy: b, = 0 versus h,: b; # 0. The critical F- 


value for 1 and 34 degrees of freedom at a 5% significance level is approximately 
4.1. (Remember, it’s a one-tailed test, so we use the 5% F-table.) Therefore, we can 
reject the null hypothesis and conclude that the slope coefficient is significantly 
different than zero. 


Hypothesis Test of a Regression Coefficient 
A t-test may also be used to test the hypothesis that the true slope coefficient, b,, is 
equal to a hypothesized value. Letting b, be the point estimate for bj, the 
appropriate test statistic with n - 2 degrees of freedom is: 
= b,—b; 
bj Sh, 


The decision rule for tests of significance for regression coefficients is: 
Reject Hy ift>+ loritical ort< teritical 
Rejection of the null supports the alternative hypothesis that the slope coefficient is 


different from the hypothesized value of b;. To test whether an independent variable 


explains the variation in the dependent variable (i.e., it is statistically significant), 
the null hypothesis is that the true slope is zero (b, = 0). The appropriate test 


structure for the null and alternative hypotheses is: 


Hp: b; = 0 versus H;: b; +0 


EXAMPLE: Hypothesis test for significance of regression coefficients 


The estimated slope coefficient from the ABC example is 0.64 with a standard 
error equal to 0.26. Assuming that the sample has 36 observations, determine if 
the estimated slope coefficient is significantly different than zero at a 5% level of 
significance. 


Answer: 
The calculated test statistic is: 
= b,-by _ 0.64-0 
76 0.26 
The critical two-tailed t-values are +2.03 (from the t-table with df = 36 - 2 = 34). 


Because t > teritical (i-€., 2-46 > 2.03), we reject the null hypothesis and conclude 
that the slope is different from zero. 


= 2.46 


Note that the t-test for a simple linear regression is equivalent to a t-test for the 
correlation coefficient between x and y: 


_ ryn-2 


i V1—-r2 


MODULE QUIZ 10.2 


= 1. Consider the following statement: “In a simple linear regression, the appropriate 
—P degrees of freedom for the critical t-value used to calculate a confidence interval 
around both a parameter estimate and a predicted Y-value is the same as the 
number of observations minus two.” This statement is: 
A. justified. 
B. not justified, because the appropriate degrees of freedom used to calculate a 
confidence interval around a parameter estimate is the number of observations. 
C. not justified, because the appropriate degrees of freedom used to calculate a 
confidence interval around a predicted Y-value is the number of observations. 


2. What is the appropriate alternative hypothesis to test the statistical significance 
of the intercept term in the following regression? 


Y= a, + a{X) + 
A. H,: a, = 0. 
B. H,: a, > 0. 
C. H,g:a, = 0. 


3. The variation in the dependent variable explained by the independent variable is 
measured by the: 
A. mean squared error. 
B. sum of squared errors. 
C. regression sum of squares. 


MODULE 10.3: PREDICTED VALUES AND 
FUNCTIONAL FORMS OF REGRESSION 


Video covering 
this content is 
available online. 


LOS 10.e: Calculate and interpret the predicted value for the 
dependent variable, and a prediction interval for it, given an 
estimated linear regression model and a value for the independent variable. 


Predicted values are values of the dependent variable based on the estimated 
regression coefficients and a prediction about the value of the independent variable. 
They are the values that are predicted by the regression equation, given an estimate 
of the independent variable. 


For a simple regression, this is the predicted (or forecast) value of Y: 
¥ = by + b,X, 
where: 


Y predicted value of the dependent variable 
X, forecasted value of the independent variable 


EXAMPLE: Predicting the dependent variable 
Given the ABC regression equation as follows: 
ABC = —2.3% + (0.64)(S&P 500) 


Calculate the predicted value of ABC excess returns if forecast S&P 500 excess 
returns are 10%. 


Answer: 
The predicted value for ABC excess returns is determined as follows: 
ABC = —2.3% + (0.64)(10%)= 4.1% 


Confidence Intervals for Predicted Values 

This is the equation for the confidence interval for a predicted value of Y: 
Y+(t,xs)>[Y—-(t.xs)<¥<Y+(t.xs)] 
where: 


tc = two-tailed critical t-value at the desired level of significance with df =n - 2 
sf = Standard error of the forecast 


The challenge with computing a confidence interval for a predicted value is 
calculating s,. On the Level I exam, it’s highly unlikely that you will have to calculate 
the standard error of the forecast (it will probably be provided if you need to 
compute a confidence interval for the dependent variable). However, if you do need 
to calculate s,, it can be done with the following formula for the variance of the 


forecast: 
s? = SEE?}1+1, K-X" 
(a — Ds 
where: 
SEE? = variance of the residuals = the square of the standard error of estimate 
= variance of the independent variable 
xX value of the independent variable for which the forecast was made 


EXAMPLE: Confidence interval for a predicted value 


Calculate a 95% prediction interval on the predicted value of ABC excess returns 
from the previous example. Assume the standard error of the forecast is 3.67, and 
the forecast value of S&P 500 excess returns is 10%. 


Answer: 
This is the predicted value for ABC excess returns: 
ABC = —2.3% + (0.64)(10%) = 4.1% 


The 5% two-tailed critical t-value with 34 degrees of freedom is 2.03. This is the 
prediction interval at the 95% confidence level: 


ABC + (t_ x s,) => [4.1% + (2.03 x 3.67%)] = 4.1% + 7.5% 
Or, -3.4% to 11.6%. 


We can interpret this range to mean that, given a forecast value for S&P 500 
excess returns of 10%, we can be 95% confident that the ABC excess returns will 
be between -3.4% and 11.6%. 


LOS 10.f: Describe different functional forms of simple linear regressions. 


One of the assumptions of linear regression is that the relationship between X and Y 
is linear. What if that assumption is violated? Consider Y = EPS for a company and X 
= time index. Suppose that EPS is growing at approximately 10% annually. Figure 
10.8 shows the plot of actual EPS versus time. 


Figure 10.8: Nonlinear Relationship 
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In such a situation, transforming one or both of the variables can produce a linear 
relationship. The appropriate transformation depends on the relationship between 
the two variables. One often-used transformation is to take the natural log of one or 
both of the variables. Here are some examples: 


» Log-lin model. This is if the dependent variable is logarithmic, while the 
independent variable is linear. 


» Lin-log model. This is if the dependent variable is linear, while the independent 
variable is logarithmic. 


= Log-log model. Both the dependent variable and the independent variable are 
logarithmic. 


Selecting the correct functional form involves determining the nature of the 
variables and evaluating the goodness-of-fit measures (e.g., R*, SEE, F-stat). 


Log-Lin Model 


Taking the natural logarithm of the dependent variable, our model now becomes 
this: 


In Yj = bo + by Xj + & 


In this model, the slope coefficient is interpreted as the relative change in dependent 
variable for an absolute change in the independent variable. Figure 10.9 shows the 
results after taking the natural log of EPS, and fitting that data using a log-lin model. 


Figure 10.9: Log-Lin Model, EPS Data 
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Lin-Log Model 


Taking the natural logarithm of the independent variable, our model now becomes 
this: 


Yi = bo + byIn(X); + & 


In this model, the slope coefficient is interpreted as the absolute change in 
dependent variable for a relative change in the independent variable. 


Log-Log Model 


Taking the natural logarithm of both variables, our model now becomes this: 
In Yj = bo + byIn(X); + & 


In this model, the slope coefficient is interpreted as the relative change in dependent 
variable for a relative change in the independent variable. 


=I MODULE QUIZ 10.3 
he 


1. For a regression model of Y = 5 + 3.5X, the analysis (based on a large data sample) 
provides the standard error of the forecast as 2.5 and the standard error of the 
slope coefficient as 0.8. A 90% confidence interval for the estimate of Y when the 
value of the independent variable is 10 is closest to: 

A. 35.1 to 44.9. 
B. 35.6 to 44.4. 
C. 35.9 to 44.1. 


2. The appropriate regression model for a linear relationship between the relative 
change in an independent variable and the absolute change in the dependent 
variable is a: 

A. log-lin model. 
B. lin-log model. 
C. lin-lin model. 


KEY CONCEPTS 


LOS 10.a 

Linear regression provides an estimate of the linear relationship between an 
independent variable (the explanatory variable) and a dependent variable (the 
predicted variable). 


The general form of a simple linear regression model is as follows: 
ae - bo + b1Xj + & 


The least squares model minimizes the sum of squared errors: 


by = fitted intercept = Y — 6, X 
b, fitted slope coefficient = Cov (X,Y) / variance of X 


The estimated intercept, b)>, represents the value of the dependent variable at the 
point of intersection of the regression line and the axis of the dependent variable 
(usually, the vertical axis). The estimated slope coefficient, db, is interpreted as the 
change in the dependent variable for a one-unit change in the independent variable. 


LOS 10.b 
Assumptions made regarding simple linear regression include the following: 


1. A linear relationship exists between the dependent and the independent variable. 
2. The variance of the residual term is constant (homoskedasticity). 
3. The residual term is independently distributed (residuals are uncorrelated). 


4. The residual term is normally distributed. 


LOS 10.c 

The total sum of squares (SST) measures the total variation in the dependent 
variable and equals the sum of the squared differences between its actual values and 
its mean. 


The sum of squares regression (SSR) measures the variation in the dependent 
variable that is explained by the independent variable, and equals the sum of the 
squared distances between the predicted values and the mean of the dependent 
variable. 


The mean square regression (MSR) is the SSR divided by the number of independent 
variables. For a simple linear regression, MSR = SSR. 


The sum of squared errors (SSE) measures the unexplained variation in the 
dependent variable and is the sum of the squared vertical distances between the 
actual and the predicted values of the dependent variable. 


The mean squared error (MSE) is the SSE divided by the degrees of freedom (n - 2 
for a simple linear regression). 


The coefficient of determination, R2, is the proportion of the total variation of the 
dependent variable explained by the regression: 

R2 — SSR _ SST — SSE 

SST SST 

In simple linear regression, because there is only one independent variable (k = 1), 
the F-test tests the same null hypothesis as testing the statistical significance of b; 
using the t-test: Hp: b; = 0 versus H,: b; # 0. With only one independent variable, F is 
calculated as follows: 


MSR 


F-stat MSE 


with 1 and n — 2 degrees of freedom 


LOS 10.d 
ANOVA Table for Simple Linear Regression (k = 1) 


Degrees of 
Source of Freedom Sum of Mean Sum of 
Variation (df) Squares Squares 
Regression (explained) 1 SSR MSR = 2k = 23k = SSR 
Error (unexplained) n-2 SSE MSE = 22k, 
Total n-1 SST 


The standard error of the estimate in a simple linear regression is calculated as 
follows: 


SEE — ,/-SSE_ 
n-—2 


LOS 10.e 


A predicted value of the dependent variable, ¥, is determined by inserting the 
predicted value of the independent variable, X,, in the regression equation and 


calculating Y, = by + b,X,. 


The confidence interval for a predicted Y-value is [y- (tex s9< Y¥<¥+(t, x s9| 
where S; is the standard error of the forecast. 


LOS 10.f 
Dependent Independent Slope 
Variable Variable Model Interpretation 

Logarithmic Linear Log-lin Relative change in dependent variable 
for an absolute change in the 
independent variable 

Linear Logarithmic Lin-log Absolute change in dependent variable 
for a relative change in the independent 
variable 


Logarithmic Logarithmic Log-log Relative change in dependent variable 
for a relative change in the independent 
variable 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 10.1 


1.B The slope coefficient is best interpreted as the predicted change in the 
dependent variable for a 1-unit change in the independent variable; if the slope 
coefficient estimate is 10.0 and the independent variable changes by 1 unit, the 
dependent variable is expected to change by 10 units. The intercept term is 
best interpreted as the value of the dependent variable when the independent 
variable is equal to zero. (LOS 10.a) 


2.C The model does not assume that the dependent variable is uncorrelated with 
the residuals. It does assume that the independent variable is uncorrelated with 
the residuals. (LOS 10.b) 


Module Quiz 10.2 


1.A Insimple linear regression, the appropriate degrees of freedom for both 


confidence intervals is the number of observations in the sample (n) minus 
two. (LOS 10.c) 


2.A In this regression, a, is the intercept term. To test the statistical significance 
means to test the null hypothesis that a, is equal to zero, versus the alternative 
that a, is not equal to zero. (LOS 10.c) 


3.C The regression sum of squares measures the amount of variation in the 
dependent variable explained by the independent variable (i.e., the explained 
variation). The sum of squared errors measures the variation in the dependent 
variable not explained by the independent variable. The mean squared error is 
equal to the sum of squared errors divided by its degrees of freedom. (LOS 10.d) 


Module Quiz 10.3 


1.C The estimate of Y, given X = 10, is Y= 5 + 3.5(10) = 40. The critical value for a 
90% confidence interval with a large sample size (z-statistic) is approximately 
1.65. Given the standard error of the forecast of 2.5, the confidence interval for 
the estimated value of Y is 40 +/- 1.65(2.5) = 35.875 to 44.125. (LOS 10.e) 


2.B The appropriate model would be a lin-log model, in which the values of the 
dependent variable (Y ) are regressed on the natural logarithms of the 
independent variable (X ): Y = by + b, In(X). (LOS 10.f) 
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LOS 11.a: Describe aspects of “fintech” that are directly relevant @vailable online. 
for the gathering and analyzing of financial data. 


The term fintech refers to developments in technology that can be applied to the 
financial services industry. Companies that are in the business of developing 
technologies for the finance industry are often referred to as fintech companies. 


Some of the primary areas where fintech is developing include the following: 


» Increasing functionality to handle large sets of data that may come from many 
sources and exist in various forms 

» Tools and techniques such as artificial intelligence for analyzing very large 
datasets 


LOS 11.b: Describe Big Data, artificial intelligence, and machine learning. 


Big Data is a widely used expression that refers to all the potentially useful 
information that is generated in the economy. This includes not only data from 
traditional sources, such as financial markets, company financial reports, and 
government economic statistics, but also alternative data from nontraditional 
sources. Some of these nontraditional sources are as follows: 


« Individuals generate usable data such as social media posts, online reviews, email, 
and website visits. 

» Businesses generate potentially useful information such as bank records and retail 
scanner data. These kinds of data are referred to as corporate exhaust. 

» Sensors, such as radio frequency identification chips, are embedded in numerous 
devices such as smartphones and smart buildings. The broad network of such 
devices is referred to as the Internet of Things. 


Characteristics of Big Data include its volume, velocity, and variety. 


The volume of data continues to grow by orders of magnitude. The units in which 
data can be measured have increased from megabytes and gigabytes to terabytes 
(1,000 gigabytes) and even petabytes (1,000 terabytes). 


Velocity refers to how quickly data are communicated. Real-time data such as stock 
market price feeds are said to have low latency. Data that are only communicated 
periodically or with a lag are said to have high latency. 


The variety of data refers to the varying degrees of structure in which data may 
exist. These range from structured forms (e.g., spreadsheets and databases), to 
semistructured forms (e.g., photos and web page code), to unstructured forms (e.g, 
video). 


The field of data science concerns how we extract information from Big Data. Data 
science describes methods for processing and visualizing data. Processing methods 
include the following: 


» Capture. This is collecting data and transforming it into usable forms. 

= Curation. This is assuring data quality by adjusting for bad or missing data. 
» Storage. This is archiving and accessing data. 

» Search. This is examining stored data to find needed information. 


« Transfer. This is moving data from their source or a storage medium to where they 
are needed. 


Visualization techniques include the familiar charts and graphs that display 
structured data. To visualize less structured data requires other methods. Some 
examples of these are word clouds that illustrate the frequency with which words 
appear in a sample of text, or mind maps that display logical relations among 
concepts. 


Taking advantage of Big Data presents numerous challenges. Analysts must ensure 
that the data they use are of high quality, accounting for the possibilities of outliers, 
bad or missing data, or sampling biases. The volume of data collected must be 
sufficient and appropriate for its intended use. 


The need to process and organize data before using it can be especially problematic 
with qualitative and unstructured data. This is a process to which artificial 
intelligence, or computer systems that can be programmed to simulate human 
cognition, may be applied usefully. Neural networks are an example of artificial 
intelligence in that they are programmed to process information in a way similar to 
the human brain. 


An important development in the field of artificial intelligence is machine learning. 
In machine learning, a computer algorithm is given inputs of source data, with no 
assumptions about their probability distributions, and may be given outputs of 
target data. The algorithm is designed to learn, without human assistance, how to 
model the output data based on the input data, or to learn how to detect and 
recognize patterns in the input data. 


Machine learning typically requires vast amounts of data. A typical process begins 
with a training dataset in which the algorithm looks for relationships. A validation 


dataset is then used to refine these relationship models, which can then be applied 
to a test dataset to analyze their predictive ability. 


In supervised learning, the input and output data are labeled, the machine learns 
to model the outputs from the inputs, and then the machine is given new data on 
which to use the model. In unsupervised learning, the input data are not labeled, 
and the machine learns to describe the structure of the data. Deep learning is a 
technique that uses layers of neural networks to identify patterns, beginning with 
simple patterns and advancing to more complex ones. Deep learning may employ 
supervised or unsupervised learning. Some of the applications of deep learning 
include image and speech recognition. 


Machine learning can produce models that overfit or underfit the data. Overfitting 
occurs when the machine learns the input and output data too exactly, treats noise 
as true parameters, and identifies spurious patterns and relationships. In effect, the 
machine creates a model that is too complex. Underfitting occurs when the 
machine fails to identify actual patterns and relationships, treating true parameters 
as noise. This means that the model is not complex enough to describe the data. A 
further challenge with machine learning is that its results can be a “black box,’ 
producing outcomes based on relationships that are not readily explainable. 


LOS 11.c: Describe applications of Big Data and Data Science to investment 
management. 


Applications of fintech that are relevant to investment management include text 
analytics, natural language processing, risk governance, and algorithmic trading. 


Text analytics refers to the analysis of unstructured data in text or voice forms. An 
example of text analytics is analyzing the frequency of words and phrases. In the 
finance industry, text analytics have the potential to partially automate specific 
tasks such as evaluating company regulatory filings. 


Natural language processing refers to the use of computers and artificial 
intelligence to interpret human language. Speech recognition and language 
translation are among the uses of natural language processing. Possible applications 
in finance could be to check for regulatory compliance in an examination of 
employee communications, or to evaluate large volumes of research reports to 
detect more subtle changes in sentiment than can be discerned from analysts’ 
recommendations alone. 


Risk governance requires an understanding of a firm’s exposure to a wide variety of 
risks. Financial regulators require firms to perform risk assessments and stress 
testing. The simulations, scenario analysis, and other techniques used for risk 
analysis require large amounts of quantitative data along with a great deal of 
qualitative information. Machine learning and other techniques related to Big Data 
can be useful in modeling and testing risk, particularly if firms use real-time data to 
monitor risk exposures. 


Algorithmic trading refers to computerized securities trading based on a 
predetermined set of rules. For example, algorithms may be designed to enter the 
optimal execution instructions for any given trade based on real-time price and 


volume data. Algorithmic trading can also be useful for executing large orders by 
determining the best way to divide the orders across exchanges. Another application 
of algorithmic trading is high-frequency trading that identifies and takes 
advantage of intraday securities mispricings. 


=) MODULE QUIZ 11.1 
—k 
1. Fintech is most accurately described as the: 
A. application of technology to the financial services industry. 
B. replacement of government-issued money with electronic currencies. 
C. clearing and settling of securities trades through distributed ledger technology. 
2. Which of the following technological developments is most likely to be useful for 
analyzing Big Data? 
A. Machine learning. 
B. High-latency capture. 
C. The Internet of Things. 


KEY CONCEPTS 


LOS l1l.a 

Fintech refers to developments in technology that can be applied to the financial 
services industry. Companies that develop technologies for the finance industry are 
referred to as fintech companies. 


LOS 11.b 

Big Data refers to the potentially useful information that is generated in the 
economy, including data from traditional and nontraditional sources. Characteristics 
of Big Data include its volume, velocity, and variety. 


Artificial intelligence refers to computer systems that can be programmed to 
simulate human cognition. Neural networks are an example of artificial intelligence. 


Machine learning is programming that gives a computer system the ability to 
improve its performance of a task over time and is often used to detect patterns in 
large sets of data. 


LOS 1l1.c 
Applications of fintech to investment management include text analytics, natural 
language processing, risk governance, and algorithmic trading. 


Text analytics refers to analyzing unstructured data in text or voice forms. Natural 
language processing is the use of computers and artificial intelligence to interpret 
human language. Algorithmic trading refers to computerized securities trading 
based on predetermined rules. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 11.1 


1.A_ Fintech is the application of technology to the financial services industry and 
to companies that are involved in developing and applying technology for 


financial services. Cryptocurrencies and distributed ledger technology are 
examples of fintech-related developments. (LOS 11.a) 


2.A Machine learning is a computer programming technique useful for identifying 
and modeling patterns in large volumes of data. The Internet of Things is the 
network of devices that is one of the sources of Big Data. Capture is one aspect 
of processing data. Latency is the lag between when data is generated and when 
it is needed. (LOS 11.b) 


Topic Quiz: Quantitative Methods 


You have now finished the Quantitative Methods topic section. Please log into your 
Schweser online dashboard and take the Topic Quiz on this section. The Topic Quiz 
provides immediate feedback on how effective your study has been for this material. 
Questions are more exam-like than typical Module Quiz or QBank questions; a score of 
less than 70% indicates that your study likely needs improvement. These tests are best 
taken timed; allow 1.5 minutes per question. 
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LOS 12.a: Determine and interpret breakeven and shutdown available online. 
points of production, as well as how economies and 
diseconomies of scale affect costs under perfect and imperfect competition. 


In economics, we define the short run for a firm as the time period over which 
some factors of production are fixed. Typically, we assume that capital is fixed in the 
short run so that a firm cannot change its scale of operations (plant and equipment) 
over the short run. All factors of production (costs) are variable in the long run. The 
firm can let its leases expire and sell its equipment, thereby avoiding costs that are 
fixed in the short run. 


Shutdown and Breakeven Under Perfect 
Competition 

Asa simple example of shutdown and breakeven analysis, consider a retail store 
with a one-year lease (fixed cost) and one employee (quasi-fixed cost), so that 
variable costs are simply the store’s cost of merchandise. If the total sales (total 
revenue) just cover both fixed and variable costs, price equals both average revenue 
and average total cost—so we are at the breakeven output quantity, and economic 
profit equals zero. 


During the period of the lease (the short run), as long as items are being sold for 
more than their variable cost, the store should continue to operate to minimize 
losses. If items are being sold for less than their average variable cost, losses would 
be reduced by shutting down the business in the short run. 


In the long run, a firm should shut down if the price is less than average total cost, 
regardless of the relation between price and average variable cost. 


For a firm under perfect competition (a price-taker), we can use a graph of cost 
functions to examine the profitability of the firm at different output prices. In Figure 
12.1, at price P;, price and average revenue equal average total cost. At the output 
level of Point A, the firm is making an economic profit of zero. At a price above P), 
economic profit is positive, and at prices less than P;, economic profit is negative 
(the firm has economic losses). 


Figure 12.1: Shutdown and Breakeven 
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Because some costs are fixed in the short run, it will be better for the firm to 
continue production in the short run as long as average revenue is greater than 
average variable costs. At prices between P, and P, in Figure 12.1, the firm has 
losses, but the losses are smaller than would occur if all production were stopped. 
As long as total revenue is greater than total variable cost, at least some of the firm’s 
fixed costs are covered by continuing to produce and sell its product. If the firm 
were to shut down, losses would be equal to the fixed costs that still must be paid. 
As long as price is greater than average variable costs, the firm will minimize its 
losses in the short run by continuing in business. 


If average revenue is less than average variable cost, the firm’s losses are greater 
than its fixed costs, and it will minimize its losses by shutting down production in 
the short run. In this case (a price less than P, in Figure 12.1), the loss from 
continuing to operate is greater than the loss (total fixed costs) if the firm is shut 
down. 


In the long run, all costs are variable, so a firm can avoid its (short-run) fixed costs 
by shutting down. For this reason, if price is expected to remain below minimum 
average total cost (Point A in Figure 12.1) in the long run, the firm will shut down 
rather than continue to generate losses. 


To sum up, if average revenue is less than average variable cost in the short run, the 
firm should shut down. This is its short-run shutdown point. If average revenue is 
greater than average variable cost in the short run, the firm should continue to 
operate, even if it has losses. In the long run, the firm should shut down if average 
revenue is less than average total cost. This is the long-run shutdown point. If 
average revenue is just equal to average total cost, total revenue is just equal to total 
(economic) cost, and this is the firm’s breakeven point. 


= If AR 2ATC, the firm should stay in the market in both the short and long run. 


= If AR = AVC, but AR < ATC, the firm should stay in the market in the short run but 
will exit the market in the long run. 


=» If AR < AVC, the firm should shut down in the short run and exit the market in the 
long run. 


Shutdown and Breakeven Under Imperfect 
Competition 


For price-searcher firms (those that face downward-sloping demand curves), we 
could compare average revenue to ATC and AVC, just as we did for price-taker firms, 
to identify shutdown and breakeven points. However, marginal revenue is no longer 
equal to price. 


We can, however, still identify the conditions under which a firm is breaking even, 
should shut down in the short run, and should shut down in the long run in terms of 
total costs and total revenue. These conditions are as follows: 


« TR=TC: break even 


» TC >TR> TVC: firm should continue to operate in the short run but shut down in 
the long run 


=» TR<TVC: firm should shut down in the short run and the long run 


Because price does not equal marginal revenue for a firm in imperfect competition, 
analysis based on total costs and revenues is better suited for examining breakeven 
and shutdown points. 


The previously described relations hold for both price-taker and price-searcher 
firms. We illustrate these relations in Figure 12.2 for a price-taker firm (TR 
increases at a constant rate with quantity). Total cost equals total revenue at the 
breakeven quantities Qpr; and Qpro. The quantity for which economic profit is 


maximized is shown as Q,,,3,- 


Figure 12.2: Breakeven Point Using the Total Revenue/Total Cost Approach 
$ 


TC 
Economic 


losses -—~, 


Economic 
Economic _— Profits 
losses ) 


Quantity 


Q.. 2 


BE2 


If the entire TC curve exceeds TR (i.e., no breakeven point), the firm will want to 
minimize the economic loss in the short run by operating at the quantity 
corresponding to the smallest (negative) value of TR — TC. 


EXAMPLE: Short-run shutdown decision 


For the last fiscal year, Legion Gaming reported total revenue of $700,000, total 
variable costs of $800,000, and total fixed costs of $400,000. Should the firm 
continue to operate in the short run? 


Answer: 


The firm should shut down. Total revenue of $700,000 is less than total costs of 
$1,200,000, and it is also less than total variable costs of $800,000. By shutting 


down, the firm will lose an amount equal to fixed costs of $400,000. This is less 
than the loss of operating, which is TR - TC = $500,000. 


EXAMPLE: Long-run shutdown decision 


Suppose, instead, that Legion Gaming reported total revenue of $850,000. Should 
the firm continue to operate in the short run? Should it continue to operate in the 
long run? 


Answer: 


In the short run, TR > TVC, and the firm should continue operating. The firm 
should consider exiting the market in the long run, as TR is not sufficient to cover 
all of the fixed costs and variable costs. 


Economies and Diseconomies of Scale 


While plant size is fixed in the short run, in the long run, firms can choose their most 
profitable scale of operations. Because the long-run average total cost (LRATC) 
curve is drawn for many different plant sizes or scales of operation, each point along 
the curve represents the minimum ATC for a given plant size or scale of operations. 
In Figure 12.3, we show a firm’s LRATC curve along with short-run average total 
cost (SRATC) curves for many different plant sizes, with SRATC,+, representing a 


larger scale of operations than SRATC,. 


Figure 12.3: Economies and Diseconomies of Scale 
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We draw the LRATC curve as U-shaped. Average total costs first decrease with 
larger scale, but eventually begin to increase with larger scale. The lowest point on 
the LRATC corresponds to the scale or plant size at which the average total cost of 
production is at a minimum. This scale is sometimes called the minimum efficient 
scale. Under perfect competition, firms must operate at minimum efficient scale in 
long-run equilibrium, and LRATC will equal the market price. Recall that under 
perfect competition, firms earn zero economic profit in long-run equilibrium. Firms 
that have chosen a different scale of operations with higher average total costs will 
have economic losses and must either leave the industry or change to the minimum 
efficient scale. 


The downward-sloping segment of the LRATC curve presented in Figure 12.3 
indicates that economies of scale (or increasing returns to scale) are present. 
Economies of scale result from factors such as labor specialization, mass 
production, and investment in more efficient equipment and technology. In addition, 
the firm may be able to negotiate lower input prices with suppliers as it increases in 
size and purchases more resources. A firm operating with economies of scale can 
increase its competitiveness by expanding production and reducing costs. 


The upward-sloping segment of the LRATC curve indicates that diseconomies of 
scale are present. Diseconomies of scale may result as the increasing bureaucracy of 
larger firms leads to inefficiency, problems with motivating a larger workforce, and 
greater barriers to innovation and entrepreneurial activity. A firm operating under 
diseconomies of scale will want to decrease output and move back toward the 
minimum efficient scale. The U.S. auto industry is an example of an industry that has 
exhibited diseconomies of scale. 


There may be a relatively flat portion at the bottom of the LRATC curve that 
exhibits constant returns to scale, or relatively constant costs across a range of plant 
sizes. 


=I MODULE QUIZ 12.1 
—=h 
1. In a purely competitive market, economic losses indicate that: 
A. price is below average total costs. 
B. collusion is occurring in the marketplace. 
C. firms need to expand output to reduce costs. 


2. A firm is likely to operate in the short run as long as price is at least as great as: 
A. marginal cost. 
B. average total cost. 
C. average variable cost. 


3. A firm’s average revenue is greater than its average variable cost and less than its 
average total cost. If this situation is expected to persist, the firm should: 
A. shut down in the short run and in the long run. 
B. shut down in the short run, but operate in the long run. 
C. operate in the short run, but shut down in the long run. 
4. If a firm increases its plant size by 10% and its minimum average total cost 
increases by 10%, the firm is experiencing: 
A. constant returns to scale. 
B. diseconomies of scale. 
C. economies of scale. 


MODULE 12.2: CHARACTERISTICS OF C) 
MARKET STRUCTURES 


Video covering 
this content is 
available online. 
LOS 12.b: Describe characteristics of perfect competition, 


monopolistic competition, oligopoly, and pure monopoly. 


Recall from the prerequisite readings that perfect competition results in firm 
demand that is horizontal (perfectly elastic) at the market price. The firm demand 
curves for the three other market structures we discuss are all downward sloping. 


When a firm’s demand curve slopes downward, marginal revenue (MR) is less than 
price. For both horizontal and downward-sloping demand curves, a firm will 
maximize profits by producing the quantity for which MR is just equal to marginal 
cost. 


While it may not be true in every case, it may be useful to think of firms under pure 
monopoly as having the steepest demand curves. Firms under monopolistic 
competition typically have relatively elastic downward-sloping demand curves, 
while firms in an oligopoly market will face downward-sloping demand curves 
somewhere between these two extremes. 


We can analyze where a market falls along the spectrum from perfect competition 
to pure monopoly by examining five factors: 


1. Number of firms and their relative sizes 

2. Degree to which firms differentiate their products 
3. Bargaining power of firms with respect to pricing 
4. Barriers to entry into or exit from the industry 


5. Degree to which firms compete on factors other than price 


Perfect competition refers to a market in which many firms produce identical 
products, barriers to entry into the market are very low, and firms compete for sales 
only on the basis of price. Firms face perfectly elastic (horizontal) demand curves at 
the price determined in the market because no firm has a large enough portion of 
the overall market to affect the market price of the good. The market for wheat in a 
region is a good approximation of such a market. Overall market supply and demand 
determine the price of wheat, and each producer can sell all that they choose to at 
that price. 


Monopolistic competition differs from perfect competition in that products are 
not identical. Each firm differentiates its product(s) from those of other firms 
through some combination of differences in product quality, product features, and 
marketing. The demand curve faced by each firm is downward sloping (i.e., neither 
perfectly elastic nor perfectly inelastic). Prices that producers charge are not 
identical because of perceived differences among their products, and typically, 
barriers to entry are low. The market for toothpaste is a good example of 
monopolistic competition. Firms differentiate their products through features and 
marketing with claims of more attractiveness, whiter teeth, fresher breath, and even 
actually cleaning your teeth and preventing decay. If the price of your personal 
favorite increases, you are not likely to immediately switch to another brand as we 
assume under perfect competition. Some customers may switch brands in response 
to a 10% increase in price, and some may not. This is why firm demand is downward 
sloping rather than perfectly elastic. 


The most important characteristic of an oligopoly market is that only a few firms 
are in the industry. In such a market, each firm must consider the actions and 
responses of other firms in setting price and business strategy. We say that such 
firms are interdependent. While the firms’ products are typically good substitutes 
for each other, they may be either quite similar or differentiated through features, 
branding, marketing, and quality. Barriers to entry are typically high, often because 


of economies of scale in production or marketing, which accounts for the existence 
of a small number of firms with relatively large market shares. Demand can be more 
or less elastic than for firms in monopolistic competition. The automobile market is 
dominated by a small number of large firms and can be characterized as an 
oligopoly. The product and pricing decisions of Toyota certainly affect those of Ford, 
and vice versa. Automobile makers compete based on price, but they also compete 
through marketing, product features, and quality, which are often signaled strongly 
through the brand name. The oil industry also has a few firms with large market 
shares, but their products are, in most cases, good substitutes for each other. 


A monopoly market is characterized by a single seller of a product with no good 
substitutes. This fact alone means that the firm faces a downward-sloping demand 
curve (the market demand curve) and has the power to choose the price at which it 
sells its product. High barriers to entry protect a monopoly producer from 
competition. One source of monopoly power is the protection offered by copyrights 
and patents. Another possible source of monopoly power is control over a resource 
specifically needed to produce the product. Most frequently, monopoly power is 
supported by specific laws or government regulation (e.g., a local electric utility). 


Figure 12.4 shows the key features of each market structure. 


Figure 12.4: Characteristics of Market Structures 
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significant 


LOS 12.c: Explain supply and demand relationships under monopolistic 
competition, including the optimal price and output for firms as well as 
pricing strategy. 


Monopolistic competition has the following market characteristics: 


« A large number of independent sellers. (1) Each firm has a relatively small market 
share, so no individual firm has any significant power over price. (2) Firms only 
need to pay attention to average market price, not the prices of individual 
competitors. (3) There are too many firms in the industry for collusion (price- 
fixing) to be possible. 

« Differentiated products. Each producer has a product that is, in some way, different 


from those of its competitors (in the minds of consumers). The competing 
products are considered close substitutes for one another. 


« Firms compete less on price and more on marketing, perceived quality, and 
differences in features. Firms must make price and output decisions because they 
face downward-sloping demand curves. 


» Low barriers to entry. The cost of entering the market and exiting the market are 
relatively low. 


Think about the market for toothpaste. Brands of toothpaste are quite similar, and it 
is reasonable to assume that toothpaste is not too difficult or costly to produce. But 
brands are differentiated based on specific features, on influential advertising and 
marketing, and on the reputations of the producers. 


The price/output decision for monopolistic competition is illustrated in Figure 12.5. 
Panel A of Figure 12.5 illustrates the short-run price/output characteristics of 
monopolistic competition for a single firm. As indicated, firms in monopolistic 
competition maximize economic profits by producing where marginal revenue (MR) 
equals marginal cost (MC), and by charging the price for that quantity from the 
demand curve, D. Here, the firm earns positive economic profits because price, P, 
exceeds average total cost, ATC’. Due to low barriers to entry, competitors can enter 
the market in pursuit of these economic profits. 


Figure 12.5: Short-Run and Long-Run Output Under Monopolistic Competition 
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Panel B of Figure 12.5 illustrates long-run equilibrium for a representative firm after 
new firms have entered the market. As indicated, the entry of new firms shifts the 
demand curve faced by each individual firm down to the point where price equals 
average total cost (P’ = ATC ), such that economic profit is zero. At this point, there 
is no longer an incentive for new firms to enter the market, and long-run market 
equilibrium is established. A firm in monopolistic competition continues to produce 
at the quantity where MR = MC, but it no longer earns positive economic profits. We 
can get to a similar long-run equilibrium even without entry of new firms if each 
firm increases its marketing spending (a component of ATC) to either increase or 
defend its market share until ATC has increased to that shown in Panel B. If all firms 
compete in this way, each firm will produce Q’ and sell at P’ but earn no economic 
profit because marketing costs have increased ATC to ATC’, which is equal to price. 
Advertising expenses are often relatively high for firms in monopolistic competition. 


Figure 12.6 illustrates the differences between long-run equilibrium in markets with 
monopolistic competition and markets with perfect competition. Note that with 
monopolistic competition, price is greater than MC (i-e., producers can realize an 


economic profit), average total cost is not at a minimum for the quantity produced 
(suggesting excess capacity, or an inefficient scale of production), and the price is 
slightly higher than under perfect competition. The point to consider here, however, 
is that perfect competition is characterized by no product differentiation. The 
question of the efficiency of monopolistic competition becomes, “Is there an 
economically efficient amount of product differentiation?” 


Figure 12.6: Firm Output Under Monopolistic and Perfect Competition 
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In a world with only one brand of toothpaste, clearly, average production costs 
would be lower. That fact alone probably does not mean that a world with only one 
brand or type of toothpaste would be a better world. While product differentiation 
has costs, it also has benefits to consumers. Consider the market for a 
pharmaceutical that reduces blood pressure to prolong life. There may be several 
competing drugs that are more or less effective for, or well or poorly tolerated by, 
different groups of patients. In this case, we may find that firm demand curves are 
relatively steep compared to those for brands of toothpaste, indicating that the 
competing drugs are not considered good substitutes for many patients. 


LOS 12.d: Explain supply and demand relationships under oligopoly, including 
the optimal price and output for firms as well as pricing strategy. 


Compared to monopolistic competition, an oligopoly market has higher barriers to 
entry and fewer firms. The other key difference is that the firms are interdependent; 
a price change by one firm can be expected to be met by a price change by its 
competitors in response. This means that the actions of another firm will directly 
affect a given firm’s demand curve for the product. Given this complicating fact, 
models of oligopoly pricing and profits must make numerous important 
assumptions. In the following, we describe four of these models and their 
implications for price and quantity: 


1. Kinked demand curve model 
2. Cournot duopoly model 
3. Nash equilibrium model 


4. Stackelberg dominant firm model 


One traditional model of oligopoly, the kinked demand curve model, is based on 
the assumption that competitors are unlikely to match a price increase by a 


competitor, but very likely to match a price decrease by a competitor. This results in 
a kink in the demand curves faced by each producer, at the current market price. 
Each firm believes that it faces a demand curve that is more elastic (flatter) above 
the current price (the kink in the demand curve) than it is below the given price. The 
kinked demand curve model is illustrated in Figure 12.7. The kink price is at price 
Px, where a firm produces Qx. A firm believes that if it raises its price above Py, its 
competitors will remain at Py, and it will lose market share because it has the 
highest price. Above P,, the demand curve is considered to be relatively elastic (i.e., 
a small price increase will result in a large decrease in demand). On the other hand, 
if a firm decreases its price below Px, other firms will match the price cut, and all 
firms will experience a relatively small increase in sales relative to any price 
reduction. Therefore, Q, is the profit-maximizing level of output. 


Figure 12.7: Kinked Demand Curve Model 
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With a kink in the demand curve, we also get a gap in the associated MR curve, as 
shown in Figure 12.8. For any firm with a MC curve passing through this gap, the 
price where the kink is located is the firm’s profit-maximizing price. 


Figure 12.8: Marginal Revenue With Kinked Demand Curve 
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We say that the decisions of firms in an oligopoly are interdependent; that is, the 
pricing decision of one firm depends on the pricing decisions of other firms. Some 
models of market price equilibrium have a set of rules for the actions of oligopolists. 
These rules assume they choose prices based on the choices of the other firms. By 
specifying the decision rules that each firm follows, we can design a model that 
allows us to determine the equilibrium prices and quantities for firms operating in 
an oligopoly market. 


An early model of oligopoly pricing decisions is the Cournot model. In Cournot’s 
duopoly model, two firms with identical MC curves each choose their preferred 
selling price based on the price the other firm chose in the previous period. Firms 
assume that the competitor’s price will not change. The long-run equilibrium for an 
oligopoly with two firms (duopoly), in the Cournot model, is for both firms to sell 
the same quantity, dividing the market equally at the equilibrium price. The 
equilibrium price is less than the price that a single monopolist would charge, but 
greater than the equilibrium price that would result under perfect competition. 
With a greater number of producers, the long-run market equilibrium price moves 
toward the competitive price. 


Another model, the Stackelberg model, uses a different set of rules and produces a 
different result. While the Cournot model assumes the competitors choose price 
simultaneously each period, the Stackelberg model assumes pricing decisions are 
made sequentially. One firm, the “leader,” chooses its price first, and the other firm 
chooses a price based on the leader’s price. In long-run equilibrium, under these 
rules, the leader charges a higher price and receives a greater proportion of the 
firms’ total profits. 


These models are early versions of rules-based models, which fall under the heading 
of what are now generally termed strategic games. Strategic games comprise 
decision models in which the best choice for a firm depends on the expected actions 
(reactions) of other firms. 


A more general model of strategic games was developed by Nobel Prize winner John 
Nash, who developed the concept of a Nash equilibrium. A Nash equilibrium is 
reached when the choices of all firms are such that there is no other choice that 
makes any firm better off (increases profits or decreases losses). The Cournot model 
results in a Nash equilibrium. In equilibrium, neither competitor can increase 
profits by changing the price they charge. 


The concept of a Nash equilibrium can be illustrated with the situation presented in 
Figure 12.9, which shows the choices and resulting profits for two firms. Each firm 
can charge either a high price or a low price. If both firms charge a high price, Firm 
A earns 1,000 and Firm B earns 600. While Firm A would not charge the low price (it 
would earn less regardless of Firm B’s decision), Firm B can increase profits to 700 
by charging a low price. With Firm A charging a high price and Firm B charging a 
low price, neither firm can increase profits by unilaterally changing its price 
strategy. Thus, we can identify the Nash equilibrium in this scenario as Firm B 
charging a low price and Firm A charging a high price. 


Figure 12.9: Nash Equilibrium 


Firm B high price Firm B low price 


Firm A high price A earns 1,000 A earns 600 
B earns 600 B earns 700 

Firm A low price A earns 160 A earns 100 
B earns 0 B earns 140 


Thus far, we have assumed that firms act competitively to maximize their individual 
profits. We can illustrate the potential of cooperative behavior among producers to 
increase the total profits of all firms in the market. Collusion refers to competitors 
making a joint agreement to charge a given price—or, alternatively, to agree to 
specific levels of output. In Figure 12.9, the greatest joint profits (1,600) are earned 
when both firms charge a high price. If Firm A offers to pay Firm B 200 for charging 
a high price, Firm A’s profits increase from 600 to 1,000. After paying 200 to Firm B, 
Firm A still gains 200. Firm B’s profits (including the payment of 200) increase from 
700 to 800. Collusion, in this case, increases the profits of both firms, compared to 
the Nash equilibrium. If firms can enter into and enforce an agreement regarding 
pricing and output, often they can all benefit. Such agreements among producers are 
illegal in many countries because they reduce competition. 


An example of a collusive agreement is the OPEC cartel. Cartel-member countries 
agree to restrict their oil production to increase the world price of oil. Members 
sometimes choose to “cheat” on the cartel agreement by producing more than the 
amount of oil they have agreed to produce. If members of a cartel do not adhere to 
the agreement (taking advantage of the higher market price but failing to restrict 
output to the agreed-upon amount), the agreement can quickly break down. 


In general, collusive agreements to increase price in an oligopoly market will be 
more successful (have less cheating) under the following conditions: 

= There are fewer firms. 

» Products are more similar (less differentiated). 

= Cost structures are more similar. 

= Purchases are relatively small and frequent. 

= Retaliation by other firms for cheating is more certain and more severe. 

= There is less actual or potential competition from firms outside the cartel. 

A final model of oligopoly behavior to consider is the dominant firm model. In this 
model, a single firm has a significantly large market share because of its greater 
scale and lower cost structure—the dominant firm (DF). In such a model, the market 


price is essentially determined by the DF, and the other competitive firms (CFs) take 
this market price as given. 


The DF believes that the quantity supplied by the other firms decreases at lower 
prices, so that the DF’s demand curve is related to the market demand curve, as 
shown in Figure 12.10. Based on this demand curve (Dp,) and its associated 
marginal revenue (MRp,) curve, the firm will maximize profits at a price of P’. The 
CFs maximize profits by producing the quantity for which their marginal cost 
(MCcr) equals P’, quantity Qcr. 


Figure 12.10: Dominant Firm Oligopoly 
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A price decrease by one of the CFs, which increases Qc, in the short run, will lead to 
a decrease in price by the DF, and CFs will decrease output or exit the industry in the 
long run. The long-run result of such a price decrease by competitors below P” 
would then be to decrease the overall market share of competitor firms and increase 
the market share of the DF. 


Clearly, oligopoly markets exhibit many possible outcomes that depend on the 
characteristics of the firms and of the market itself. The important point is that the 
firms’ decisions are interdependent so that the expected reactions of other firms are 
an important consideration. Overall, the resulting price will be somewhere between 
the price based on perfect collusion that would maximize total profits to all firms in 
the market (which is actually the monopoly price), and the price that would result 
from perfect competition and generate zero economic profits in the long run. These 
two limiting outcomes are illustrated in Figure 12.11 as Poonusion With Qoolusion fOr 


perfect collusion, and Poompetition 29d Qcompetition for perfect competition. 


Figure 12.11: Collusion vs. Perfect Competition 
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1. The demand for products from monopolistic competitors is relatively elastic due to: 
A. high barriers to entry. 
B. the availability of many close substitutes. 
C. the availability of many complementary goods. 


2. Compared to a perfectly competitive industry, in an industry characterized by 
monopolistic competition: 
A. both price and quantity are likely to be lower. 
B. price is likely to be higher, and quantity is likely to be lower. 
C. quantity is likely to be higher, and price is likely to be lower. 


3. A firm will most likely maximize profits at the quantity of output for which: 
A. price equals marginal cost. 
B. price equals marginal revenue. 
C. marginal cost equals marginal revenue. 


4. An oligopolistic industry has: 
A. few barriers to entry. 
B. few economies of scale. 
C. a great deal of interdependence among firms. 


5. Consider a firm in an oligopoly market that believes the demand curve for its 
product is more elastic above a certain price than below this price. This belief fits 
most appropriately to which of the following models? 

A. Cournot model. 
B. Dominant firm model. 
C. Kinked demand model. 


6. Consider an agreement between France and Germany that will restrict wine 
production so that maximum economic profit can be realized. The possible 
outcomes of the agreement are presented in the following table. 


Germany complies Germany defaults 
France complies France gets €8 billion France gets €2 billion 
Germany gets €8 billion Germany gets €10 billion 
France defaults = France gets €10 billion France gets €4 billion 
Germany gets €2 billion Germany gets €4 billion 


Based on the concept of a Nash equilibrium, the most likely strategy followed 
by the two countries with respect to whether they comply with or default on 
the agreement will be: 


A. both countries will default. 
B. both countries will comply. 
C. one country will default and the other will comply. 


MODULE 12.3: IDENTIFYING MARKET Cc) 
STRUCTURES 


Video covering 
this content is 
available online. 


LOS 12.e: Identify the type of market structure within which a 
firm operates and describe the use and limitations of 
concentration measures. 


We can use the characteristics we outlined earlier to identify the type of market 
structure within which a firm is operating. Our earlier table is repeated here in 
Figure 12.12. For an analyst attempting to determine the degree of pricing power 
that firms in the industry have, the focus is on the number of firms in the industry, 
its barriers to entry, the nature of substitute products, and the nature of industry 
competition. Significant interdependence among firm pricing and output decisions is 
a characteristic of all oligopoly markets, although some interdependence may be 
present under monopolistic competition—even with many more firms than for an 
oligopoly structure. 


The following table illustrates the differences in characteristics among the various 
market structures. 


Figure 12.12: Characteristics of Market Structures 


Perfect Monopolistic 

Competition Competition Oligopoly Monopoly 
Number of Many firms Many firms Few firms Single firm 
sellers 
Barriers to Very low Low High Very high 
entry 
Nature of Very good Good Good substitutes No good 
substitute substitutes substitutes, but or differentiated § substitutes 
products differentiated 
Nature of Price only Price, marketing, Price, marketing, Advertising 
competition features features 
Pricing power None Some Some to Significant 


significant 


Given the differences in cost structures, suitability of substitutes, and the degree of 
differentiation, simply classifying the primary characteristics of a market does not 
tell us the degree of pricing power for individual firms or the magnitude of the 
difference between market prices and the prices implied by perfect competition. 
What we would really like (especially for the regulation of markets) is to be able to 
measure the elasticity of firm demand directly, but that is difficult and subject to 
estimation error. 


Consequently, regulators often use market shares (percentages of market sales) to 
measure the degree of monopoly or market power of firms in an industry. Often, 
mergers or acquisitions of companies in the same industry or market are not 
permitted by government authorities when they determine that the market share of 
the combined firms will be too high and, therefore, detrimental to the economy. 


Market or industry concentration measures are often used as an indicator of 
market power. One concentration measure is the N-firm concentration ratio, 
which is calculated as the sum of the percentage market shares of the largest N firms 
ina market. While this measure is simple to calculate and understand, it does not 
directly measure market power or elasticity of demand. 


One limitation of the N-firm concentration ratio is that it may be relatively 
insensitive to mergers of firms within an industry. This problem is reduced by using 
an alternative measure of market concentration, the Herfindahl-Hirschman Index 
(HHI). The HHI is calculated as the sum of the squares of the market shares of the 


largest firms in the market. The following example illustrates this difference 
between the two measures and their calculation. 


EXAMPLE: 4-firm concentration ratio and 4-firm HHI 


Given the market shares of the following firms, calculate the 4-firm concentration 
ratio and the 4-firm HHI, both before and after a merger of Acme and Blake. 


Firm Market Share 


Acme 25% 

Blake 15% 

Curtis 15% 

Dent 10% 

Erie 5% 

Federal 5% 
Answer: 


Before the merger, the 4-firm concentration ratio for the market is 25+ 15+15+ 
10 = 65%. After the merger, the Acme + Blake firm has 40% of the market, and the 
4-firm concentration ratio is 40 + 15 + 10 + 5 = 70%. Although the 4-firm 
concentration ratio has only increased slightly, the market power of the largest 
firm in the industry has increased significantly, from 25% to 40%. 


Before the merger, the 4-firm HHI is 0.252 + 0.152 + 0.15 + 0.102 = 0.1175. 


After the merger, the 4-firm HHI is 0.402 + 0.152 + 0.107 + 0.052 = 0.1950, a 
significant increase. 


A limitation that applies to both of our simple concentration measures is that 
barriers to entry are not considered in either case. Even a firm with high market 
share may not have much pricing power if barriers to entry are low and there is 
potential competition. With low barriers to entry, it may be the case that other firms 
stand ready to enter the market if firms currently in the market attempt to increase 
prices significantly. In this case, the elasticity of demand for existing firms may be 
high, even though they have relatively high market shares and industry 
concentration measures. 


=) MODULE QUIZ 12.3 
1. Which of the following is most likely an advantage of the Herfindahl-Hirschman 
Index (HHD) relative to the N-firm concentration ratio? 
A. The HHI is simpler to calculate. 
B. The HHI considers barriers to entry. 


C. The HHI is more sensitive to mergers. 


2. A market characterized by low barriers to entry, good substitutes, limited pricing 
power, and marketing of product features is best characterized as: 
A. oligopoly. 
B. perfect competition. 
C. monopolistic competition. 


KEY CONCEPTS 


LOS 12.a 
The breakeven quantity of production is the quantity for which price (P) = average 
total cost (ATC), and total revenue (TR) = total cost (TC). 


A firm should shut down in the long run if P < ATC so that TR < TC. A firm should 
shut down in the short run (and the long run) if P < average variable cost (AVC) so 
that TR < total variable cost (TVC). 


The long-run average total cost (LRATC) curve shows the minimum average total 
cost for each level of output, assuming that the plant size (scale of the firm) can be 
adjusted. A downward-sloping segment of an LRATC curve indicates economies of 
scale (increasing returns to scale). Over such a segment, increasing the scale of the 
firm reduces ATC. An upward-sloping segment of an LRATC curve indicates 
diseconomies of scale, where average unit costs will rise as the scale of the business 
(and long-run output) increases. 

LOS 12.b 


Perfect competition is characterized by the following: 


» Many firms, each small relative to the market 

= Very low barriers to entry into or exit from the industry 

=» Homogeneous products that are perfect substitutes; no advertising or branding 
» No pricing power 

Monopoly is characterized by the following: 


« A single firm that comprises the whole market 

« Very high barriers to entry into or exit from the industry 
» Advertising used to compete with substitute products 

» Significant pricing power 

Monopolistic competition is characterized by the following: 


» Many firms 

= Low barriers to entry into or exit from the industry 

« Differentiated products; heavy advertising and marketing expenditure 
=» Some pricing power 


Oligopoly markets are characterized by the following: 


= Few sellers 
» High barriers to entry into or exit from the industry 
» Products that may be homogeneous or differentiated by branding and advertising 


» Firms that may have significant pricing power 


LOS 12.¢ 


Under monopolistic competition, firms face downward-sloping demand curves so 
that marginal revenue is less than price, and the price from the demand curve at the 
profit-maximizing quantity is the price at the optimal (profit-maximizing) level of 
output. Resources expended on product differentiation may increase ATC so that 


there are no economic profits at long-run equilibrium. There is no well-defined firm 
supply curve. 


Monopolistic competition: 


» Price > marginal revenue = marginal cost (in equilibrium) 


» Zero economic profit in long-run equilibrium 


LOS 12.d 

Under oligopoly, the pricing strategy is not clear. Because firm decisions are 
interdependent, the optimal pricing and output strategy depends on assumptions 
made about other firms’ cost structures and about competitors’ responses to a firm’s 
price changes. 


Models of oligopoly pricing include the following: 


= Cournot. Firms with a given MC that make pricing decisions simultaneously share 
the market equally in LR equilibrium. 

« Stackelberg. When firm pricing decisions are sequential, the first firm to set its 
price (price leader) will maintain a larger share of the market, even in the long 
run. 


= Dominant firm. A firm with a significantly lower cost of production will gain a 
disproportionate share of the market and essentially set the price that other 
competitors can charge. 


In general, in the absence of collusion, the long-run oligopoly equilibrium is Nash 
equilibrium, one in which no competitor can unilaterally change price to increase its 
profits. There is no well-defined firm supply curve. 


Oligopoly: 


» Price > marginal revenue = marginal cost (in equilibrium) 


« May have positive economic profit in long-run equilibrium, but may move toward 
zero economic profit over time 


LOS 12.e 


To identify the market structure in which a firm is operating, we can examine the 
number of firms in its industry, the prevalence of nonprice competition, and barriers 
to entry, then compare these to the characteristics that define each market 
structure. However, none of this provides us with the elasticity of firm demand. 


Because direct estimation of the elasticity of firm demand is statistically difficult 
and imprecise, industry concentration measures are often used as a proxy for the 
degree of competition. A concentration ratio for N firms is calculated as the 
percentage of market sales accounted for by the N largest firms in the industry and 
is used as a simple measure of market structure and market power. 


The Herfindahl-Hirschman Index measure of concentration is calculated as the sum 
of the squared market shares of the largest N firms in an industry and better reflects 
the effect of mergers on industry concentration. 


Neither measure indicates market power directly. Both can be misleading when 
potential competition restricts pricing power. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 12.1 


1.A Ina purely competitive market, economic losses indicate that firms are 
overproducing, causing prices to fall below average total costs. This can occur 
in the short run. In the long run, however, market supply will decrease as firms 
exit the industry, and prices will rise to the point where economic profits are 
zero. (LOS 12.a) 


2.C If price is greater than average variable cost, a firm will continue to operate in 
the short run because it is covering at least some of its fixed costs. (LOS 12.a) 


3.C Ifa firm is generating sufficient revenue to cover its variable costs and part of 
its fixed costs, it should continue to operate in the short run. If average revenue 
is likely to remain below average total costs in the long run, the firm should 
shut down. (LOS 12.a) 


4.B If minimum average total costs increase as plant size is increased, the firm is 
experiencing diseconomies of scale. (LOS 12.a) 


Module Quiz 12.2 


1.B The demand for products from firms competing in monopolistic competition is 
relatively elastic due to the availability of many close substitutes. If a firm 
increases its product price, it will lose customers to firms selling substitute 
products at lower prices. (LOS 12.c) 


2.B Monopolistic competition is likely to result in a higher price and lower 
quantity of output compared to perfect competition. (LOS 12.c) 


3.C The profit-maximizing output is the quantity at which marginal revenue equals 
marginal cost. In a price-searcher industry structure (i.e. any structure that is 
not perfect competition), price is greater than marginal revenue. (LOS 12.c) 


4.C An oligopolistic industry has a great deal of interdependence among firms. One 
firm’s pricing decisions or advertising activities will affect the other firms. 
(LOS 12.d) 


5.C The kinked demand model assumes that each firm in a market believes that at 
some price, demand is more elastic for a price increase than for a price 
decrease. (LOS 12.d) 


6.A The Nash equilibrium results when each nation pursues the strategy that is 
best, given the strategy that is pursued by the other nation. 


» Given that Germany complies with the agreement: France will get €8 billion 
if it complies, but €10 billion if it defaults. Therefore, France should default. 

» Given that Germany defaults: France will get €2 billion if it complies, but €4 
billion if it defaults. Therefore, France should default. 


Because France is better off in either case by defaulting, France will default. 
Germany will follow the same logic and reach the same conclusion. (LOS 12.d) 


Module Quiz 12.3 


1.C Although the N-firm concentration ratio is simple to calculate, it can be 


relatively insensitive to mergers between companies with large market shares 


Neither the HHI nor the N-firm concentration ratio consider barriers to entry. 
(LOS 12.e) 


2.C These characteristics are associated with a market structure of monopolistic 


competition. Firms in perfect competition do not compete on product features 
Oligopolistic markets have high barriers to entry. (LOS 12.e) 


READING 13 


UNDERSTANDING BUSINESS 
CYCLES 


MODULE 13.1: BUSINESS CYCLES a 


Video covering 
this content is 
available online. 


LOS 13.a: Describe the business cycle and its phases. 


The business cycle is characterized by fluctuations in economic 

activity. The business cycle has four phases: expansion (real GDP is increasing), 
peak (real GDP stops increasing and begins decreasing), contraction or recession 
(real GDP is decreasing), and trough (real GDP stops decreasing and begins 
increasing). The phases are illustrated in Figure 13.1. 


Figure 13.1: Business Cycle 
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There are alternative ways to show business cycles. The curve in the figure 
illustrates the classical cycle, which is based on real GDP relative to a beginning 
value. The growth cycle refers to changes in the percentage difference between real 
GDP and its longer-term trend or potential value (shown as the “average” line in the 
figure). The growth rate cycle refers to changes in the annualized percentage 
growth rate from one period to the next and tends to show both peaks and troughs 
earlier than the other two measures. The growth rate cycle—which, like the growth 
cycle, shows GDP relative to a trend rate—is the preferred measure for economists 
and practitioners. 


An expansion features growth in most sectors of the economy, with increasing 
employment, consumer spending, and business investment. As an expansion 


approaches its peak, the rates of increase in spending, investment, and employment 
slow but remain positive, while inflation accelerates. 


A contraction or recession is associated with declines in most sectors, with inflation 
typically decreasing. When a contraction reaches a trough and the economy begins a 
new expansion or recovery, economic growth becomes positive again and inflation 
is typically moderate, but employment growth might not start to increase until the 
expansion has taken hold convincingly. 


Economists commonly consider two consecutive quarters of growth in real GDP to 
be the beginning of an expansion, and two consecutive quarters of declining real GDP 
to be the beginning of a contraction. Statistical agencies that date expansions and 
recessions, such as the National Bureau of Economic Research in the United States, 
look at a wider variety of additional economic data—especially unemployment, 
industrial production, and inflation—to identify turning points in the business cycle. 


A key aspect of business cycles is that they recur, but not at regular intervals. Past 
business cycles have been as short as a year, or longer than a decade. The idea of a 
business cycle applies to economies that consist mainly of businesses. For 
economies that are mostly subsistence agriculture or dominated by state planning, 
fluctuations in activity are not really “business cycles” in the sense we are 
discussing here. 


LOS 13.b: Describe credit cycles. 


Credit cycles refer to cyclical fluctuations in interest rates and the availability of 
loans (credit). Typically, lenders are more willing to lend, and tend to offer lower 
interest rates, during economic expansions. Conversely, they are less willing to lend, 
and require higher interest rates, when the economy is slowing (contracting). 


Credit cycles may amplify business cycles. Widely available or “loose” credit 
conditions during expansions can lead to “bubbles” (prices based on implausible 
expectations) in the markets for some assets, such as subprime mortgages in the 
period leading up to the financial crisis of 2007-2009. Some research suggests that 
expansions tend to be stronger, and contractions deeper and longer lasting, when 
they coincide with credit cycles. They do not always coincide, however, as historical 
data suggest credit cycles have been longer in duration than business cycles on 
average. 


LOS 13.c: Describe how resource use, consumer and business activity, housing 
sector activity, and external trade sector activity vary over the business cycle 
and describe their measurement using economic indicators. 


Business Cycles and Resource Use Fluctuation 


Inventories are an important business cycle indicator. Firms try to keep enough 
inventory on hand to meet sales demand but do not want to keep too much of their 
capital tied up in inventory. As a result, the inventory-sales ratio in many 
industries tends toward a normal level in times of steady economic growth. 


When an expansion is approaching its peak, sales growth begins to slow, and unsold 
inventories accumulate. This can be seen in an increase in the inventory-sales ratio 
above its normal level. Firms respond to this unplanned increase in inventory by 
reducing production, which is one of the causes of the subsequent contraction in the 
economy. An increase in inventories is counted in the GDP statistics as economic 
output whether the increase is planned or unplanned. An analyst who looks only at 
GDP growth, rather than the inventory-sales ratio, might see economic strength 
rather than the beginning of weakness. 


The opposite occurs when a contraction reaches its trough. Having reduced their 
production levels to adjust for lower sales demand, firms find their inventories 
becoming depleted more quickly once sales growth begins to accelerate. This causes 
the inventory-sales ratio to decrease below its normal level. To meet the increase in 
demand, firms will increase output, and the inventory-sales ratio will increase back 
toward normal levels. 


One of the ways that firms react to fluctuations in business activity is by adjusting 
their use of labor and physical capital. Adding and subtracting workers in lockstep 
with changes in economic growth would be costly for firms, in terms of both direct 
expenses and the damage it would do to employee morale and loyalty. Instead, firms 
typically begin by changing how they use their current workers, producing less or 
more output per hour or adjusting the hours they work by adding or removing 
overtime. Only when an expansion or contraction appears likely to persist will they 
hire or lay off workers. 


Similarly, because it is costly to adjust production levels by frequently buying and 
selling plant and equipment, firms first adjust their production levels by using their 
existing physical capital more or less intensively. As an expansion persists, firms will 
increase their production capacity by investing more in plant and equipment. During 
contractions, however, firms will not necessarily sell plant and equipment outright. 
They can reduce their physical capacity by spending less on maintenance or by 
delaying the replacement of equipment that is nearing the end of its useful life. 


Consumer Sector Activity 


Consumer spending, the largest component of gross domestic product, depends on 
the level of consumers’ current incomes and their expectations about their future 
incomes. As a result, consumer spending increases during expansions and decreases 
during contractions. 


Consumer spending in some sectors is more sensitive to business cycle phases than 
spending in other sectors. Spending on durable goods is highly cyclical because 
they are often higher-value purchases. Consumers are more willing to purchase 
high-value durable goods (e.g., appliances, furniture, automobiles) during expansions, 
when incomes are increasing and economic confidence is high. During contractions 
(and sometimes extending into the early stages of expansions), consumers often 
postpone durable goods purchases until they are more confident about their 
employment status and prospects for income growth. 


Consumer spending on services is also positively correlated with business cycle 
phases, but not to the same extent as durable goods spending. Services include some 
discretionary spending, such as for travel, lodging, and restaurant meals, but they 


also include spending that is less discretionary, such as for telecommunications, 
health care, and insurance. Spending on nondurable goods, such as food at home or 
household products for everyday use, remains relatively stable over the business 
cycle. 


Housing Sector Activity 


Although the housing sector is a smaller part of the economy relative to overall 
consumer spending, cyclical swings in activity in the housing market can be large, so 
that the effect on overall economic activity is greater than it otherwise would be. 
Important determinants of the level of economic activity in the housing sector are 
as follows: 


1. Mortgage rates. Low interest rates tend to increase home buying and construction, 
while high interest rates tend to reduce home buying and construction. 


2. Housing costs relative to income. When incomes are cyclically high (low) relative 
to home costs, including mortgage financing costs, home buying and construction 
tend to increase (decrease). Housing activity can decrease even when incomes are 
rising if home prices are rising faster than incomes, as often occurs late in 
expansions. 


3. Speculative activity. As we saw in the housing sector in 2007-08 in many 
economies, rising home prices can lead to purchases based on expectations of 
further gains. Higher prices led to more construction and eventually excess 
building. This resulted in falling prices that decreased or eliminated speculative 
demand and led to dramatic decreases in house prices and in housing activity 
overall. 


4. Demographic factors. The proportion of the population in the 25- to 40-year-old 
segment is positively related to activity in the housing sector because these are 
the ages of greatest household formation. Strong population shifts from rural areas 
to cities, as may occur in newly industrializing economies, may require large 
increases in construction of new housing to accommodate those needs. 


External Trade Sector Activity 


The most important factors determining the level of a country’s imports and exports 
are domestic GDP growth, GDP growth of trading partners, and currency exchange 
rates. Increasing growth of domestic GDP leads to increases in purchases of foreign 
goods (imports), while decreasing domestic GDP growth reduces imports. Exports 
depend on the growth rates of GDP of other economies (especially those of 
important trading partners). Increasing foreign incomes increase sales to foreigners 
(exports), and decreasing economic growth in foreign countries decreases domestic 
exports. 


An increase in the value of a country’s currency makes its goods more expensive to 
foreign buyers and foreign goods less expensive to domestic buyers, which tends to 
decrease exports and increase imports. A decrease in the value of a country’s 
currency has the opposite effect (increasing exports and decreasing imports). 
Currencies affect import and export volumes over time in response to persistent 
trends in foreign exchange rates, rather than in response to short-term changes, 
which can be quite volatile. 


Currency effects on imports and exports can differ in direction from GDP growth 
effects and respond to a complex set of variables. The effects of changes in GDP 
levels and growth rates are more direct and immediate. 


Typical business cycle characteristics may be summarized as follows: 
« Trough: 
- The GDP growth rate changes from negative to positive. 


- There is a high unemployment rate, and an increasing use of overtime and 
temporary workers. 


-— Spending on consumer durable goods and housing may increase. 
- There is a moderate or decreasing inflation rate. 
= Expansion: 
- The GDP growth rate increases. 
- The unemployment rate decreases as hiring accelerates. 


- There are investment increases in producers’ equipment and home 
construction. 


- The inflation rate may increase. 
— Imports increase as domestic income growth accelerates. 
« Peak: 
- The GDP growth rate decreases. 
- The unemployment rate decreases, but hiring slows. 
— Consumer spending and business investments grow at slower rates. 


The inflation rate increases. 


= Contraction/recession: 
- The GDP growth rate is negative. 


Hours worked decrease; unemployment rate increases. 
— Consumer spending, home construction, and business investments decrease. 


- The inflation rate decreases with a lag. 


Imports decrease as domestic income growth slows. 


Economic Indicators 


Economic indicators can be classified into three categories: leading indicators that 
have been known to change direction before peaks or troughs in the business cycle, 
coincident indicators that change direction at roughly the same time as peaks or 
troughs, and lagging indicators that tend not to change direction until after 
expansions or contractions are already underway. 


—) MODULE QUIZ 13.1 
—=h 
1. In the early part of an economic expansion, inventory-sales ratios are most likely 
to: 
A. increase because sales are unexpectedly low. 
B. increase because businesses plan for expansion. 
C. decrease because of unexpected increases in sales. 


2. The contraction phase of the business cycle is least likely accompanied by 
decreasing: 
A. unemployment. 
B. inflation pressure. 
C. economic output. 
3. Which economic sector would most likely correlate strongly and positively with 
credit cycles? 
A. Exports. 
B. Food retail. 
C. Construction. 


4. An economic indicator that has turning points that tend to occur after the turning 
points in the business cycle is classified as a: 
A. lagging indicator. 
B. leading indicator. 
C. trailing indicator. 


5. When they recognize the beginning of a recession, companies are most likely to 
adjust their stock of physical capital by: 
A. selling physical assets. 
B. deferring maintenance of equipment. 
C. canceling orders for new construction equipment. 


KEY CONCEPTS 


LOS 13.a 
The business cycle has four phases: 


1. Expansion. Real GDP is increasing (or GDP growth relative to trend is increasing). 


2. Peak. Real GDP stops increasing/begins decreasing (GDP growth relative to trend 
peaks). 

3. Contraction. Real GDP is decreasing (or GDP growth relative to trend is 
decreasing). 

4. Trough. Real GDP stops decreasing/begins increasing (GDP growth relative to 
trend reaches a low). 


Expansions feature increasing output, employment, consumption, investment, and 
inflation. Contractions are characterized by decreases in these indicators. 


Business cycles are recurring, but they do not occur at regular intervals, can differ in 
strength or severity, and do not persist for specific lengths of time. 


LOS 13.b 


Credit cycles are cyclical fluctuations in interest rates and credit availability. Credit 
cycles may amplify business cycles and cause bubbles in the markets for some 
assets. 


LOS 13.ce 


Inventory to sales ratios typically increase late in expansions when sales slow, and 
decrease near the end of contractions when sales begin to accelerate. Firms decrease 
or increase production to restore their inventory-sales ratios to their desired levels. 


Because hiring and laying off employees have high costs, firms prefer to adjust their 
use of current employees. As a result, firms are slow to lay off employees early in 
contractions and slow to add employees early in expansions. 


Firms use their physical capital more intensively during expansions, investing in 
new capacity only if they believe the expansion is likely to continue. They use 
physical capital less intensively during contractions, but they are more likely to 
reduce capacity by deferring maintenance and not replacing equipment than by 
selling their physical capital. 


Consumer spending fluctuates with the business cycle. Durable goods spending is 
highly sensitive to business cycles, and spending on services is somewhat sensitive, 
but spending on nondurable goods is relatively less sensitive to business cycles. 


The level of activity in the housing sector is affected by mortgage rates, 
demographic changes, the ratio of income to housing prices, and investment or 
speculative demand for homes resulting from recent price trends. 


Domestic imports tend to rise with increases in GDP growth and domestic currency 
appreciation, while increases in foreign incomes and domestic currency 
depreciation tend to increase domestic export volumes. 


Leading indicators have turning points that tend to precede those of the business 
cycle. 


Coincident indicators have turning points that tend to coincide with those of the 
business cycle. 


Lagging indicators have turning points that tend to occur after those of the business 
cycle. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 13.1 


1.C Early in an expansion, inventory-sales ratios typically decrease below their 
normal levels as accelerating sales draw down inventories of produced goods. 
(LOS 13.c) 


2.A An economic contraction is likely to feature increasing unemployment (i.e., 
decreasing employment), along with declining economic output and decreasing 
inflation pressure. (LOS 13.a) 


3.C Credit cycles are associated with interest rates and the availability of credit, 
which is important in the financing of construction and the purchase of 
property. (LOS 13.b) 


4. A Lagging indicators have turning points that occur after business cycle turning 
points. (LOS 13.c) 


5.B Physical capital adjustments to downturns typically are made through aging of 
equipment plus postponing maintenance. (LOS 13.c) 


READING 14 
FISCAL POLICY 


MODULE 14.1: FISCAL POLICY Go 
OBJECTIVES 


Video covering 
this content is 


LOS 14.a: Compare monetary and fiscal policy. available online. 


Fiscal policy refers to a government’s use of spending and taxation to influence 
economic activity. The budget is said to be balanced when tax revenues equal 
government expenditures. A budget surplus occurs when government tax revenues 
exceed expenditures, and a budget deficit occurs when government expenditures 
exceed tax revenues. An increase in the deficit (or a decrease in a surplus) is 
considered expansionary in that it tends to increase GDP. A decrease in a deficit (or 
increase in a surplus) is considered contractionary in that it tends to decrease GDP. 


Monetary policy refers to the central bank’s actions that affect the quantity of 
money and credit in an economy to influence economic activity. Monetary policy is 
said to be expansionary (or accommodative or easy) when the central bank 
increases the quantity of money and credit in an economy. Conversely, when the 
central bank is reducing the quantity of money and credit in an economy, monetary 
policy is said to be contractionary (or restrictive or tight). 


Policymakers use both monetary and fiscal policies with the goals of maintaining 
stable prices and producing positive economic growth. Fiscal policy can also be 
used as a tool for redistribution of income and wealth. 


LOS 14.b: Describe roles and objectives of fiscal policy as well as arguments 
as to whether the size of a national debt relative to GDP matters. 


Objectives of fiscal policy may include the following: 


» Influencing the level of economic activity and aggregate demand 

» Redistributing wealth and income among segments of the population 

» Allocating resources among economic agents and sectors in the economy 

In general, decreased taxes and increased government spending both increase a 


budget deficit, overall demand, economic growth, and employment; increased taxes 
and decreased government spending decrease these. Budget deficits are increased in 


response to recessions, and budget deficits are decreased to slow growth when 
inflation is too high. 


Keynesian economists believe that fiscal policy, through its effect on aggregate 
demand, can have a strong effect on economic growth when the economy is 
operating at less than full employment. Monetarists believe that the effect of fiscal 
stimulus is only temporary, and that monetary policy should be used to increase or 
decrease inflationary pressures over time. Monetarists do not believe that monetary 
policy should be used in an attempt to influence aggregate demand to counter 
cyclical movements in the economy. 


Discretionary fiscal policy refers to the spending and taxing decisions of a national 
government that are intended to stabilize the economy. In contrast, automatic 
stabilizers are built-in fiscal devices triggered by the state of the economy. For 
example, during a recession, tax receipts will fall, and government expenditures on 
unemployment insurance payments will increase. Both of these tend to increase 
budget deficits and are expansionary. Similarly, during boom times, higher tax 
revenues coupled with lower outflows for social programs tend to decrease budget 
deficits and are contractionary. 


When a government runs fiscal deficits, it incurs debt that needs to be repaid as well 
as ongoing interest expense. Total deficits, annual deficits, and interest expense can 
all be evaluated relative to annual GDP. When these ratios increase beyond certain 
levels, it may be a cause for concern, and the solvency of the country may be 
questioned. 


A country’s debt ratio is the ratio of aggregate debt to GDP. Because taxes are linked 
to GDP, when an economy grows in real terms, so will tax revenues. If the real 
interest rate on the government’s debt is higher than the real growth rate of the 
economy, then the debt ratio will increase over time (keeping tax rates constant). 
Similarly, if the real interest rate on the government’s debt is lower than real growth 
in GDP, the debt ratio will decrease (i.e., improve) over time. 


Arguments for being concerned with the size of a fiscal deficit are as follows: 


« Higher deficits lead to higher future taxes. Higher future taxes will lead to 
disincentives to work and entrepreneurship. This leads to lower long-term 
economic growth. 


» If markets lose confidence in the government, investors may not be willing to 
refinance the debt. This can lead to the government defaulting (if debt is ina 
foreign currency) or having to simply print money (if the debt is in local 
currency). Printing money would ultimately lead to higher inflation. 

» Increased government borrowing will tend to increase interest rates, and firms 
may reduce their borrowing and investment spending as a result, thus decreasing 
the impact on aggregate demand of deficit spending. This is referred to as the 
crowding-out effect because government borrowing is taking the place of 
private-sector borrowing. 


Arguments against being concerned with the size of a fiscal deficit are as follows: 


= Ifthe debt is primarily held by domestic citizens, the scale of the problem is 
overstated. 


= Ifthe debt is used to finance productive capital investment, future economic gains 
will be sufficient to repay the debt. 


» Fiscal deficits may prompt needed tax reform. 


« Deficits would not matter if Ricardian equivalence holds, which means private- 
sector savings in anticipation of future tax liabilities just offset the government 
deficit. 


» Ifthe economy is operating at less than full capacity, deficits do not divert capital 
away from productive uses and can aid in increasing GDP and employment. 


=) MODULE QUIZ 14.1 


1. Both monetary and fiscal policy are used to: 
A. balance the budget. 
B. achieve economic targets. 


C. redistribute income and wealth. 

2. Roles and objectives of fiscal policy most likely include: 
A. controlling the money supply to limit inflation. 
B. adjusting tax rates to influence aggregate demand. 
C. using government spending to control interest rates. 


MODULE 14.2: FISCAL POLICY TOOLS CC) 
AND IMPLEMENTATION 


Video covering 
this content is 
available online. 


LOS 14.c: Describe tools of fiscal policy, including their 
advantages and disadvantages. 


Fiscal policy tools include spending tools and revenue tools. 


Spending Tools 

Transfer payments, also known as entitlement programs, redistribute wealth, 
taxing some and making payments to others. Examples include government-run 
retirement income plans (such as Social Security in the United States) and 
unemployment insurance benefits. Transfer payments are not included in GDP 
computations. 


Current spending refers to government purchases of goods and services on an 
ongoing and routine basis. 


Capital spending refers to government spending on infrastructure, such as roads, 
schools, bridges, and hospitals. Capital spending is expected to boost future 
productivity of the economy. 


Spending tools are commonly justified as means of pursuing the following goals: 

» Provide services such as national defense that benefit all the residents in a 
country. 

» Invest in infrastructure to enhance economic growth. 


= Support the country’s growth and unemployment targets by directly affecting 
aggregate demand. 


= Provide a minimum standard of living. 


» Subsidize investment in research and development for certain high-risk ventures 
consistent with future economic growth or other goals (e.g. green technology). 


Revenue Tools 


Direct taxes are levied on income or wealth. These include income taxes, taxes on 
income for national insurance, wealth taxes, estate taxes, corporate taxes, capital 
gains taxes, and Social Security taxes. Some progressive taxes (such as income taxes 
and wealth taxes) collect revenue for wealth and income redistributing. 


Indirect taxes are levied on goods and services. These include sales taxes, value- 
added taxes (VATs), and excise taxes. Indirect taxes can be used to reduce 
consumption of some goods and services (e.g., alcohol, tobacco, gambling). 


Desirable attributes of tax policy are as follows: 


» Simplicity to use and enforce 


« Efficiency, defined here as minimizing interference with market forces and not 
acting as a deterrent to working 


= Fairness is quite subjective, but two of the commonly held beliefs are horizontal 
equality (people in similar situations should pay similar taxes) and vertical 
equality (richer people should pay more in taxes) 


s Sufficiency, in that taxes should generate enough revenues to meet the spending 
needs of the government 


Advantages of fiscal policy tools include: 


= Social policies (e.g, discouraging tobacco use) can be implemented quickly via 
indirect taxes. 

# Quick implementation of indirect taxes also means that government revenues can 
be increased without significant additional costs. 


Disadvantages of fiscal policy tools include: 


« Direct taxes and transfer payments take time to implement, delaying the impact of 
fiscal policy. 

» Capital spending also takes a long time to implement; the economy may have 
recovered by the time its impact is felt. 


Announcing a change in fiscal policy may have significant effects on expectations. 
For example, an announcement of a future increase in taxes may immediately reduce 
current consumption, rapidly producing the desired goal of reducing aggregate 
demand. 


Not all fiscal policy tools affect economic activity equally. Spending tools are most 
effective in increasing aggregate demand. Tax reductions are somewhat less effective, 
as people may not spend the entire amount of the tax savings. Tax reductions for 
those with low incomes will be more effective in increasing aggregate demand, as 
those with lower incomes tend to spend a larger proportion of income on 
consumption; that is, they save a smaller proportion of income and have a higher 
marginal propensity to consume. 


Fiscal Multiplier 


Changes in government spending have magnified effects on aggregate demand 
because those whose incomes increase from increased government spending will, in 
turn, increase their spending, which increases the incomes and spending of others. 
The magnitude of the multiplier effect depends on the tax rate and on the marginal 
propensity to consume. 


To understand the calculation of the multiplier effect, consider an increase in 
government spending of $100 when the MPC is 80% and the tax rate is 25%. The 
increase in spending increases incomes by $100, but $25 (100 x 0.25) of that will be 
paid in taxes. Disposable income is equal to income after taxes, so disposable 
income increases by $100 x (1 - 0.25) = $75. With an MPC of 80%, additional 
spending by those who receive the original $100 increase is $75 x 0.8 = $60. 


This additional spending will increase others’ incomes by $60 and disposable 
incomes by $60 x 0.75 = $45, from which they will spend $45 x 0.8 = $36. 


Because each iteration of this process reduces the amount of additional spending, 
the effect reaches a limit. The fiscal multiplier determines the potential increase in 
aggregate demand resulting from an increase in government spending: 


— 1 
fiscal multiplier 1 — MPC(1 — t) 


Here, with a tax rate of 25% and an MPC of 80%, the fiscal multiplier is 1 / [1 - 0.8(1 
- 0.25)] = 2.5, and the increase of $100 in government spending has the potential to 
increase aggregate demand by $250. 


The fiscal multiplier is inversely related to the tax rate (higher tax rate decreases the 
multiplier) and is directly related to the marginal propensity to consume (higher 
MPC increases the multiplier). 


Balanced Budget Multiplier 


To balance the budget, the government could increase taxes by $100 to just offset a 
$100 increase in spending. Changes in taxes also have a magnified effect on aggregate 
demand. An increase in taxes will decrease disposable income and consumption 
expenditures, thereby decreasing aggregate demand. The initial decrease in spending 
from a tax increase of $100 is 100 x MPC = 100 x 0.8 = $80; beyond that, the 
multiplier effect is the same as we described for a direct increase in government 
spending, and the overall decrease in aggregate demand for a $100 tax increase is 
100(MPC) x fiscal multiplier—or, for our example, 100(0.8)(2.5) = $200. 


Combining the total increase in aggregate demand from a $100 increase in 
government spending with the total decrease in aggregate demand from a $100 tax 
increase shows that the net effect on aggregate demand of both is an increase of 
$250 - $200 = $50, so we can say that the balanced budget multiplier is positive. 


If, instead of a $100 increase in taxes, we increased taxes by 100 / MPC = 100 / 0.8 = 
$125 and increased government spending by $100, the net effect on aggregate 
demand would be zero. 


Ricardian Equivalence 


Increases in the current deficit mean greater taxes in the future. To maintain their 
preferred pattern of consumption over time, taxpayers may increase current savings 
(reduce current consumption) to offset the expected cost of higher future taxes. If 
taxpayers reduce current consumption and increase current saving by just enough to 
repay the principal and interest on the debt the government issued to fund the 
increased deficit, there is no effect on aggregate demand. This is known as Ricardian 
equivalence, after economist David Ricardo. If taxpayers underestimate their future 
liability for servicing and repaying the debt so that aggregate demand is increased 
by equal spending and tax increases, Ricardian equivalence does not hold. Whether 
it does is an open question. 


LOS 14.d: Explain the implementation of fiscal policy and difficulties of 
implementation as well as whether a fiscal policy is expansionary or 
contractionary. 


Fiscal policy is implemented through changes in taxes and spending. This is called 
discretionary fiscal policy (as opposed to automatic stabilizers, discussed 
previously). Discretionary fiscal policy would be designed to be expansionary when 
the economy is operating below full employment. Fiscal policy aims to stabilize 
aggregate demand. During recessions, actions can be taken to increase government 
spending or decrease taxes. Either change tends to strengthen the economy by 
increasing aggregate demand, putting more money in the hands of corporations and 
consumers to invest and spend. During inflationary economic booms, actions can be 
taken to decrease government spending or increase taxes. Either change tends to 
slow the economy by decreasing aggregate demand, taking money out of the hands 
of corporations and consumers and causing both investment and consumption 
spending to decrease. 


Discretionary fiscal policy is not an exact science. First, economic forecasts might 
be wrong, leading to incorrect policy decisions. Second, complications arise in 
practice that delay both the implementation of discretionary fiscal policy and the 
impact of policy changes on the economy. The lag between recessionary or 
inflationary conditions in the economy and the impact on the economy of fiscal 
policy changes can be divided into three types: 


1. Recognition lag. Discretionary fiscal policy decisions are made by a political 
process. The state of the economy is complex, and it may take policymakers time 
to recognize the nature and extent of the economic problems. 

2. Action lag. Governments take time to discuss, vote on, and enact fiscal policy 
changes. 

3. Impact lag. Time elapses between the enactment of fiscal policy changes and 
when the impact of the changes on the economy actually takes place. It takes time 
for corporations and individuals to act on the fiscal policy changes, and fiscal 
multiplier effects occur only over time as well. 


These lags can actually make fiscal policy counterproductive. For example, if the 
economy is in a recession phase, fiscal stimulus may be deemed appropriate. 


However, by the time fiscal stimulus is implemented and has its full impact, the 
economy may already be on a path to a recovery driven by the private sector. 


Additional macroeconomic issues may hinder the usefulness of fiscal policy: 


« Misreading economic statistics. The full employment level for an economy is not 
precisely measurable. If the government relies on expansionary fiscal policy 
mistakenly at a time when the economy is already at full capacity, it will simply 
drive inflation higher. 


= Crowding-out effect. Expansionary fiscal policy may crowd out private investment. 
Greater government borrowing tends to increase interest rates, which decreases 
private investments (some investments will be uneconomic with higher 
borrowing costs). This crowding-out effect can reduce the impact of expansionary 
fiscal policy on aggregate demand. Opinions vary on the magnitude of this effect. 


« Supply shortages. If economic activity is slow due to resource constraints (low 
availability of labor or other resources) and not due to low demand, expansionary 
fiscal policy will fail to achieve its objective and will probably lead to higher 
inflation. 


« Limits to deficits. There is a limit to expansionary fiscal policy. If the markets 
perceive that the deficit is already too high as a proportion of GDP, funding the 
deficit will be problematic. This could lead to higher interest rates and actually 
make the situation worse. 


« Multiple targets. If the economy has high unemployment coupled with high 
inflation, fiscal policy cannot address both problems simultaneously. 


To determine if fiscal policy is expansionary or contractionary, economists often 

focus on changes in the budget surplus or deficit. An increase (decrease) in surplus 
is indicative of a contractionary (expansionary) fiscal policy. Similarly, an increase 
(decrease) in deficit is indicative of an expansionary (contractionary) fiscal policy. 


PROFESSOR’S NOTE 

For the exam, an increase (decrease) in a revenue item (e.g., sales tax) 
should be considered contractionary (expansionary), and an increase 
(decrease) in a spending item (e.g., construction of highways) should be 
considered expansionary (contractionary). 


A government’s intended fiscal policy is not necessarily obvious from just 
examining the current deficit. Consider an economy that is in recession so that 
transfer payments are increased and tax revenue is decreased, leading to a deficit. 
This does not necessarily indicate that fiscal policy is expansionary as, at least to 
some extent, the deficit is a natural outcome of the recession without any explicit 
action of the government. Economists often use a measure called the structural 
budget deficit (or cyclically adjusted budget deficit) to gauge fiscal policy. This is 
the deficit that would occur based on current policies if the economy were at full 
employment. 


=I MODULE QUIZ 14.2 
he 
1. A government enacts a program to subsidize farmers with an expansive spending 


program of $10 billion. At the same time, the government enacts a $10 billion tax 
increase over the same period. Which of the following statements best describes 


the impact on aggregate demand? 
A. Lower growth, because the tax increase will have a greater effect. 
B. No effect, because the tax and spending effects just offset each other. 
C. Higher growth, because the spending increase will have a greater effect. 
2. A government reduces spending by $50 million. The tax rate is 30%, and 
consumers exhibit a marginal propensity to consume of 80%. The change in 
aggregate demand caused by the change in government spending is closest to: 
A. -$66 million. 
B. —$114 million. 
C. —$250 million. 
3. The size of a national debt is most likely to be a concern for policymakers if: 
A. Ricardian equivalence holds. 
B. a crowding-out effect occurs. 
C. debt is used to finance capital growth. 


4. A government is concerned about the timing of the impact of fiscal policy changes 
and is considering requiring the compilation and reporting of economic statistics 
weekly, rather than quarterly. The new reporting frequency is intended to 
decrease the: 

A. action lag. 
B. impact lag. 
C. recognition lag. 


5. Fiscal policy is most likely to be expansionary if tax rates: 
A. and government spending both decrease. 
B. decrease and government spending increases. 
C. increase and government spending decreases. 


KEY CONCEPTS 


LOS 14.a 


Fiscal policy is a government’s use of taxation and spending to influence the 
economy. Monetary policy deals with determining the quantity of money supplied 
by the central bank. Both policies aim to achieve economic growth with price level 
stability, although governments use fiscal policy for social and political reasons as 
well. 


LOS 14.b 

Objectives of fiscal policy can include (1) influencing the level of economic activity, 

(2) redistributing wealth or income, and (3) allocating resources among industries. 

There are arguments for being concerned with the size of a fiscal deficit: 

« Higher future taxes lead to disincentives to work, negatively affecting long-term 
economic growth. 

» Fiscal deficits may not be financed by the market when debt levels are high. 


» There is a crowding-out effect as government borrowing increases interest rates 
and decreases the private-sector investment. 


There are arguments against being concerned with the size of a fiscal deficit: 


= Debt may be financed by domestic citizens. 


= Deficits for capital spending can boost the productive capacity of the economy. 


» Fiscal deficits may prompt needed tax reform. 


» Ricardian equivalence may prevail: private savings rise in anticipation of the need 
to repay principal on government debt. 


=» When the economy is operating below full employment, deficits do not crowd out 
private investment. 


LOS 14.c 


Fiscal policy tools include spending tools and revenue tools. Spending tools include 
transfer payments, current spending (goods and services used by government), and 
capital spending (investment projects funded by government). Revenue tools include 
direct and indirect taxation. 


An advantage of fiscal policy is that indirect taxes can be used to quickly implement 
social policies and can also be used to quickly raise revenues at a low cost. 


Disadvantages of fiscal policy include time lags for implementing changes in direct 
taxes and time lags for capital spending changes to have an impact. 


LOS 14.d 


Fiscal policy is implemented by government changes in taxing and spending policies. 
Delays in realizing the effects of fiscal policy changes limit their usefulness. Delays 
can be caused by the following: 


« Recognition lag. Policymakers may not immediately recognize when fiscal policy 
changes are needed. 


» Action lag. Governments take time to enact needed fiscal policy changes. 


« Impact lag. Fiscal policy changes take time to affect economic activity. 


In general, fiscal policies that increase the government deficit are often considered 
expansionary, and policies that decrease the deficit are considered contractionary. 
However, because a deficit will naturally increase during a recession and decrease 
during an expansion, the impact of a current deficit is often judged relative to a 
structural (cyclically adjusted) deficit amount. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 14.1 


1.B Both monetary and fiscal policies primarily strive to achieve economic targets 
such as inflation and GDP growth. Balancing the budget is not a goal for 
monetary policy and is a potential outcome of fiscal policy. Fiscal policy (but 
not monetary policy) may secondarily be used as a tool to redistribute income 
and wealth. (LOS 14.a) 


2.B Influencing the level of aggregate demand through taxation and government 
spending is an objective of fiscal policy. Controlling inflation and interest rates 
are typical objectives of monetary policy. (LOS 14.b) 


Module Quiz 14.2 


LC 


2.B 


3.B 


4.C 


5.B 


The amount of the spending program exactly offsets the amount of the tax 
increase, leaving the budget unaffected. The multiplier for government 
spending is greater than the multiplier for a tax increase. Therefore, the 
balanced budget multiplier is positive. All of the government spending enters 
the economy as increased expenditure, whereas spending is reduced by only a 
portion of the tax increase. (LOS 14.c) 


fiscal multiplier = 1 / [1 - MPC (1 - T)] =1/ [1 - 0.80(1 - 0.3)] = 2.27 
change in government spending = -$50 million 
change in aggregate demand = -(50 x 2.27) = -$113.64 million 


(LOS 14.c) 


Crowding out refers to the possibility that government borrowing causes 
interest rates to increase and private investment to decrease. If government 
debt is financing the growth of productive capital, this should increase future 
economic growth and tax receipts to repay the debt. Ricardian equivalence is 
the theory that if government debt increases, private citizens will increase 
savings in anticipation of higher future taxes, and it is an argument against 
being concerned about the size of government debt and budget deficits. (LOS 
14.d) 


More frequent and current economic data would make it easier for authorities 
to monitor the economy and to recognize problems. The reduction in the time 
between economic reports should reduce the recognition lag. (LOS 14.d) 


Increases in government spending and decreases in taxes are expansionary 


fiscal policy. Decreases in spending and increases in taxes are contractionary 
fiscal policy. (LOS 14.d) 


READING 15 
MONETARY POLICY 


MODULE 15.1: CENTRAL BANK Go 
OBJECTIVES AND TOOLS 


Video covering 
this content is 


LOS 15.a: Describe the roles and objectives of central banks. available online. 


There are several key roles of central banks: 


1. Sole supplier of currency. Central banks have the sole authority to supply money. 
Traditionally, such money was backed by gold; the central bank stood ready to 
convert the money into a prespecified quantity of gold. Later on, the gold backing 
was removed, and money supplied by the central bank was deemed legal tender 
by law. Money not backed by any tangible value is termed fiat money. As long as 
fiat money holds its value over time and is acceptable for transactions, it can 
continue to serve as a medium of exchange. 


2. Banker to the government and other banks. Central banks provide banking services 
to the government and other banks in the economy. 


Ww 


. Regulator and supervisor of payments system. In many countries, central banks 
may regulate the banking system by imposing standards of risk-taking allowed 
and reserve requirements of banks under its jurisdiction. Central banks also 
oversee the payments system to ensure smooth clearing operations domestically 
and in conjunction with other central banks for international transactions. 


4. Lender of last resort. Central banks’ ability to print money allows them to supply 
money to banks that are experiencing shortages. This government backing tends 
to prevent runs on banks (i.e., large-scale withdrawals) by assuring depositors that 
their funds are secure. 


5. Holder of gold and foreign exchange reserves. Central banks are often the 
repositories of the nation’s gold and reserves of foreign currencies. 


6. Conductor of monetary policy. Central banks control or influence the quantity of 
money supplied in an economy and growth of money supply over time. 


The primary objective of a central bank is to control inflation so as to promote price 
stability. High inflation is not conducive to a stable economic environment. High 
inflation leads to menu costs (i.e., cost to businesses of constantly having to change 
their prices) and shoe leather costs (i.e., costs to individuals of making frequent 
trips to the bank so as to minimize their holdings of cash that are depreciating in 
value due to inflation). 


In addition to price stability, some central banks have other stated goals: 


= Stability in exchange rates with foreign currencies 
= Full employment 
» Sustainable positive economic growth 


= Moderate long-term interest rates 


The target inflation rate in most developed countries is a range around 2% to 3%. A 
target of zero inflation is not used because that increases the risk of deflation, which 
can be disruptive for an economy. 


While most developed countries have an explicit target inflation rate, the U.S. Fed 
and the Bank of Japan do not. In the United States, this is because the Fed has the 
additional goals of maximum employment and moderate long-term interest rates. In 
Japan, it is because deflation, rather than inflation, has been a persistent problem in 
recent years. 


Some developed countries, and several developing countries, choose a target level 
for the exchange rate of their currency with that of another country, primarily the 
US. dollar. This is referred to as pegging their exchange rate with the dollar. If their 
currency appreciates (i.e., becomes relatively more valuable), they can sell their 
domestic currency reserves for dollars to reduce the exchange rate. While such 
actions may be effective in the short run, for stability of the exchange rate over time, 
the monetary authorities in the pegging country must manage interest rates and 
economic activity to achieve their goal. This can lead to increased volatility of their 
money supply and interest rates. The pegging country essentially commits to a 
policy intended to make its inflation rate equal to the inflation rate of the country to 
which it pegs its currency. 


LOS 15.b: Describe tools used to implement monetary policy tools and the 
monetary transmission mechanism, and explain the relationships between 
monetary policy and economic growth, inflation, interest, and exchange rates. 


Monetary policy is implemented using the monetary policy tools of the central 
bank. The three main policy tools of central banks are as follows: 


1. Policy rate. In the United States, banks can borrow funds from the Fed if they have 
temporary shortfalls in reserves. The rate at which banks can borrow reserves 
from the Fed is called the discount rate. For the European Central Bank (ECB), it is 
called the refinancing rate. 

One way to lend money to banks is through a repurchase agreement. The central 
bank purchases securities from banks that, in turn, agree to repurchase the 
securities at a higher price in the future. The percentage difference between the 
purchase price and the repurchase price is effectively the rate at which the central 
bank is lending to member banks. The Bank of England uses this method, and its 
policy rate is called the two-week repo (repurchase) rate. A lower rate reduces 
banks’ cost of funds, encourages lending, and tends to decrease interest rates 
overall. A higher policy rate has the opposite effect, decreasing lending and 
increasing interest rates. 


In the United States, the federal funds rate is the rate that banks charge each other 
on overnight loans of reserves. The Fed sets a target for this market-determined 
rate and uses open market operations to influence it toward the target rate. 


2. Reserve requirements. By increasing the reserve requirement (the percentage of 
deposits that banks are required to retain as reserves), the central bank effectively 
decreases the funds that are available for lending and thereby decreases the 
money supply, which will tend to increase interest rates. A decrease in the reserve 
requirement will increase the funds available for lending and the money supply, 
which will tend to decrease interest rates. This tool only works well to increase 
the money supply if banks are willing to lend and customers are willing to borrow. 


3. Open market operations. Buying and selling of securities by the central bank is 
referred to as open market operations. When the central bank buys securities, cash 
replaces securities in investor accounts, banks have excess reserves, more funds 
are available for lending, the money supply increases, and interest rates decrease. 
Sales of securities by the central bank have the opposite effect, reducing cash in 
investor accounts, excess reserves, funds available for lending, and the money 
supply, which will tend to cause interest rates to increase. In the United States, 
open market operations are the Fed’s most commonly used tool and are important 
in achieving the federal funds target rate. 


Monetary Transmission Mechanism 


The monetary transmission mechanism refers to the ways in which a change in 
monetary policy, specifically the central bank’s policy rate, affects the price level 
and inflation. A change in the policy rates that the monetary authorities control 
directly is transmitted to prices through four channels: other short-term rates, asset 
values, currency exchange rates, and expectations. 


We can examine the transmission mechanism in more detail by considering the 
effects of a change to a contractionary monetary policy implemented through an 
increase in the policy rate: 


» Banks’ short-term lending rates will increase in line with the increase in the policy 
rate. The higher rates will decrease aggregate demand as consumers reduce credit 
purchases and businesses cut back on investment in new projects. 


= Bond prices, equity prices, and asset prices in general will decrease as the discount 
rates applied to future expected cash flows are increased. This may have a wealth 
effect because a decrease in the value of households’ assets may increase the 
savings rate and decrease consumption. 


=» Both consumers and businesses may decrease their expenditures because their 
expectations for future economic growth decrease. 


« The increase in interest rates may attract foreign investment in debt securities, 
leading to an appreciation of the domestic currency relative to foreign currencies. 
An appreciation of the domestic currency increases the foreign currency prices of 
exports and can reduce demand for the country’s export goods. 


Taken together, these effects act to decrease aggregate demand and put downward 
pressure on the price level. A decrease in the policy rate would affect the price level 
through the same channels, but in the opposite direction. 


Monetary Policy Relation With Economic Growth, 
Inflation, Interest, and Exchange Rates 


If money neutrality holds, changes in monetary policy and the policy rate will have 
no effect on real output. In the short run, however, changes in monetary policy can 
affect real economic growth as well as interest rates, inflation, and foreign exchange 
rates. The effects of a change to a more expansionary monetary policy may include 
any or all of the following: 


The central bank buys securities, which increases bank reserves. 


The interbank lending rate decreases as banks are more willing to lend each other 
reserves. 


Other short-term rates decrease as the increase in the supply of loanable funds 
decreases the equilibrium rate for loans. 


Longer-term interest rates also decrease. 


The decrease in real interest rates causes the currency to depreciate in the foreign 
exchange market. 


The decrease in long-term interest rates increases business investments in plant 
and equipment. 


Lower interest rates cause consumers to increase their purchases of houses, autos, 
and durable goods. 


Depreciation of the currency increases foreign demand for domestic goods. 


The increases in consumption, investment, and net exports all increase aggregate 
demand. 


The increase in aggregate demand increases inflation, employment, and real GDP. 


= MODULE QUIZ 15.1 


» 
1. A central bank’s policy goals least likely include: 
A. price stability. 
B. minimizing long-term interest rates. 
C. maximizing the sustainable growth rate of the economy. 


2. A country that targets a stable exchange rate with another country’s currency 
least likely: 
A. accepts the inflation rate of the other country. 
B. will sell its currency if its foreign exchange value rises. 
C. must also match the money supply growth rate of the other country. 


3. If a country’s inflation rate is below the central bank’s target rate, the central 
bank is most likely to: 
A. sell government securities. 
B. increase the reserve requirement. 
C. decrease the overnight lending rate. 


4. A central bank conducts monetary policy primarily by altering the: 
A. policy rate. 
B. inflation rate. 
C. long-term interest rate. 


5. Purchases of securities in the open market by the monetary authorities are least 
likely to increase: 
A. excess reserves. 


B. cash in investor accounts. 
C. the interbank lending rate. 


6. An increase in the policy rate will most likely lead to an increase in: 
A. business investments in fixed assets. 
B. consumer spending on durable goods. 
C. the foreign exchange value of the domestic currency. 


MODULE 15.2: MONETARY POLICY Cc) 
EFFECTS AND LIMITATIONS 


Video covering 
this content is 


. - : available online. 
LOS 15.c: Describe qualities of effective central banks; contrast 


their use of inflation, interest rate, and exchange rate targeting 
in expansionary or contractionary monetary policy; and describe the 
limitations of monetary policy. 


For a central bank to succeed in its inflation targeting policies, it should have three 
essential qualities: 


1. Independence. For a central bank to be effective in achieving its goals, it should be 
free from political interference. Reducing the money supply to reduce inflation 
can also be expected to decrease economic growth and employment. The political 
party in power has an incentive to boost economic activity and reduce 
unemployment before elections. For this reason, politicians may interfere with the 
central bank’s activities, compromising its ability to manage inflation. 
Independence should be thought of in relative terms (degrees of independence) 
rather than absolute terms. Even in the case of relatively independent central 
banks, the heads of the banks may be appointed by politicians. 

Independence can be evaluated based on both operational independence and 
target independence. Operational independence means that the central bank is 
allowed to independently determine the policy rate. Target independence means 
the central bank also defines how inflation is computed, sets the target inflation 
level, and determines the horizon over which the target is to be achieved. The ECB 
has both target and operational independence, while most other central banks 
have only operational independence. 


2. Credibility. To be effective, central banks should follow through on their stated 
intentions. If a government with large debts, instead of a central bank, sets an 
inflation target, the target would not be credible because the government has an 
incentive to allow inflation to exceed the target level. On the other hand, a credible 
central bank’s targets can become self-fulfilling prophecies. If the market believes 
that a central bank is serious about achieving a target inflation rate of 3%, wages 
and other nominal contracts will be based on 3% inflation, and actual inflation 
will then be close to that level. 

3. Transparency. Transparency on the part of central banks aids their credibility. 
Transparency means that central banks periodically disclose the state of the 
economic environment by issuing inflation reports. Transparent central banks 
periodically report their views on the economic indicators and other factors they 
consider in their interest rate setting policy. When a central bank makes clear the 


economic indicators it uses in establishing monetary policy and how they will be 
used, it not only gains credibility, but also makes policy changes easier to 
anticipate and implement. 


Inflation, Interest Rate, and Exchange Rate 
Targeting by Central Banks 


Central banks have used various economic variables and indicators over the years to 
make monetary policy decisions. In the past, some have used interest rate 
targeting, increasing the money supply when specific interest rates rose above the 
target band and decreasing the money supply (or the rate of money supply growth) 
when interest rates fell below the target band. Currently, inflation targeting is the 
most widely used tool for making monetary policy decisions, and it is the method 
required by law in some countries. Central banks that currently use inflation 
targeting include the United Kingdom, Brazil, Canada, Australia, Mexico, and the 
ECB. 


The most common inflation rate target is 2%, with a permitted deviation of +1% so 
the target band is 1% to 3%. The reason the inflation target is not 0% is that 
variations around that rate would allow for negative inflation (i.e., deflation), which 
is considered disruptive to the smooth functioning of an economy. Central banks are 
not necessarily targeting current inflation, which is the result of prior policy and 
events, but inflation in the range of two years in the future. 


Some countries, especially developing countries, use exchange rate targeting. That 
is, they target a foreign exchange rate between their currency and another (often the 
US. dollar), rather than targeting inflation. As an example, consider a country that 
has targeted an exchange rate for its currency versus the U.S. dollar. If the foreign 
exchange value of the domestic currency falls relative to the U.S. dollar, the 
monetary authority must use foreign reserves to purchase their domestic currency 
(which will reduce money supply growth and increase interest rates) to reach the 
target exchange rate. Conversely, an increase in the foreign exchange value of the 
domestic currency above the target rate will require sale of the domestic currency 
in currency markets to reduce its value (increasing the domestic money supply and 
decreasing interest rates) to move toward the target exchange rate. One result of 
exchange rate targeting may be greater volatility of the money supply because 
domestic monetary policy must adapt to the necessity of maintaining a stable 
foreign exchange rate. 


Over the short term, the targeting country can purchase or sell its currency in the 
foreign exchange markets to influence the exchange rate. There are limits, however, 
on how much influence currency purchases or sales can have on exchange rates over 
time. For example, a country may run out of foreign reserves with which to purchase 
its currency when the exchange value of its currency is still below the target 
exchange rate. 


The net effect of exchange rate targeting is that the targeting country will have the 
same inflation rate as the country with the targeted currency, and the targeting 
country will need to follow monetary policy and accept interest rates that are 
consistent with this goal regardless of domestic economic circumstances. 


Limitations of Monetary Policy 


This transmission mechanism for monetary policy previously described does not 
always produce the intended results. In particular, long-term rates may not rise and 
fall with short-term rates because of the effect of monetary policy changes on 
expected inflation. 


If individuals and businesses believe that a decrease in the money supply intended 
to reduce inflation will be successful, they will expect lower future inflation rates. 
Because long-term bond yields include a premium for expected inflation, long-term 
rates could fall (tending to increase economic growth), even while the central bank 
has increased short-term rates to slow economic activity. Conversely, increasing the 
money supply to stimulate economic activity could lead to an increase in expected 
inflation rates and long-term bond yields, even as short-term rates fall. 


From a different perspective, monetary tightening may be viewed as too extreme— 
increasing the probability of a recession, making long-term bonds more attractive, 
and reducing long-term interest rates. If money supply growth is seen as 
inflationary, higher expected future asset prices will make long-term bonds 
relatively less attractive and will increase long-term interest rates. Bond market 
participants that act in this way have been called bond market vigilantes. When 
the central bank’s policy is credible and investors believe that the inflation target 
rate will be maintained over time, this effect on long-term rates will be small. 


Another situation in which the transmission mechanism may not perform as 
expected is if demand for money becomes very elastic and individuals willingly hold 
more money, even without a decrease in short-term rates. Such a situation is called a 
liquidity trap. Increasing the growth of the money supply will not decrease short- 
term rates under these conditions because individuals hold the money in cash 
balances instead of investing in interest-bearing securities. If an economy is 
experiencing deflation even though monetary policy has been expansionary, 
liquidity trap conditions may be present. 


Compared to inflation, deflation is more difficult for central banks to reverse. Ina 
deflationary environment, monetary policy needs to be expansionary. However, the 
central bank is limited to reducing the nominal policy rate to zero. Once it reaches 
zero, the central bank has a limited ability to further stimulate the economy. 


Another reason standard tools for increasing the money supply might not increase 
economic activity is that even with increasing excess reserves, banks may not be 
willing to lend. When what has become known as the credit bubble collapsed in 
2008, banks around the world lost equity capital and desired to rebuild it. For this 
reason, they decreased their lending even as money supplies were increased, and 
short-term rates fell. With short-term rates near zero, economic growth still poor, 
and a threat of deflation, central banks began a policy termed quantitative easing. 


In the United Kingdom, quantitative easing entailed large purchases of British 
government bonds in the maturity range of three to five years. The intent was to 
reduce interest rates to encourage borrowing and to generate excess reserves in the 
banking system to encourage lending. Uncertainty about the economy’s future 
caused banks to behave quite conservatively and willingly hold more excess 
reserves, rather than make loans. 


In the United States, billions of dollars were made available for the Fed to buy assets 
other than short-term Treasury securities. Large amounts of mortgage securities 
were purchased from banks to encourage bank lending and to reduce mortgage rates 
in an attempt to revive the housing market, which had collapsed. When this program 
did not have the desired effect, a second round of quantitative easing (QE2) was 
initiated. The Fed purchased long-term Treasury bonds in large quantities (hundreds 
of billions of dollars) with the goal of bringing down longer-term interest rates and 
generating excess reserves to increase lending and economic growth. The Fed has 
also purchased securities with credit risk as part of its quantitative easing, 
improving banks’ balance sheets but perhaps shifting risk from the private sector to 
the public sector. 


Monetary Policy in Developing Economies 


Developing countries face problems in successfully implementing monetary policy. 
Without a liquid market in their government debt, interest rate information may be 
distorted and open market operations difficult to implement. In a very rapidly 
developing economy, it may be quite difficult to determine the neutral rate of 
interest for policy purposes. Rapid financial innovation may change the demand to 
hold monetary aggregates. Central banks may lack credibility because of past 
failure(s) to maintain inflation rates in a target band and might not be given 
independence by the political authority. 


LOS 15.d: Explain the interaction of monetary and fiscal policy. 


Monetary policy and fiscal policy may each be either expansionary or 
contractionary, so there are four possible scenarios: 


1. Expansionary fiscal and monetary policy. In this case, the impact will be highly 
expansionary, taken together. Interest rates will usually be lower (due to monetary 
policy), and the private and public sectors will both expand. 


2. Contractionary fiscal and monetary policy. In this case, aggregate demand and 
GDP would be lower, and interest rates would be higher due to tight monetary 
policy. Both the private and public sectors would contract. 


3. Expansionary fiscal policy and contractionary monetary policy. In this case, 
aggregate demand will likely be higher (due to fiscal policy), while interest rates 
will be higher (due to increased government borrowing and tight monetary 
policy). Government spending as a proportion of GDP will increase. 


4. Contractionary fiscal policy and expansionary monetary policy. In this case, 
interest rates will fall from decreased government borrowing and from the 
expansion of the money supply, increasing both private consumption and output. 
Government spending as a proportion of GDP will decrease due to contractionary 
fiscal policy. The private sector would grow as a result of lower interest rates. 


Not surprisingly, the fiscal multipliers for different types of fiscal stimulus differ, and 
the effects of expansionary fiscal policy are greater when it is combined with 
expansionary monetary policy. The fiscal multiplier for direct government spending 
increases has been much higher than the fiscal multiplier for increases in transfers 


to individuals or tax reductions for workers. Transfer payments to the poor have the 
greatest relative impact, followed by tax cuts for workers, and broader-based 
transfers to individuals (not targeted). For all types of fiscal stimulus, the impact is 
greater when the fiscal actions are combined with expansionary monetary policy. 
This may reflect the impact of greater inflation, falling real interest rates, and the 
resulting increase in business investments. 


=I MODULE QUIZ 15.2 
=k 
1. Qualities of effective central banks include: 
A. credibility and verifiability. 
B. comparability and relevance. 
C. independence and transparency. 
2. Monetary policy is likely to be least responsive to domestic economic conditions if 
policymakers employ: 
A. inflation targeting. 
B. interest rate targeting. 
C. exchange rate targeting. 
3. Monetary policy is most likely to fail to achieve its objectives when the economy is: 
A. growing rapidly. 
B. experiencing deflation. 
C. experiencing disinflation. 


KEY CONCEPTS 


LOS 15.a 


Central bank roles include supplying currency, acting as a banker to the government 
and to other banks, regulating and supervising the payments system, acting as a 
lender of last resort, holding the nation’s gold and foreign currency reserves, and 
conducting monetary policy. 


Central banks have the objective of controlling inflation, and some have additional 
goals of maintaining currency stability, full employment, positive sustainable 
economic growth, or moderate interest rates. 


LOS 15.b 


Policy tools available to central banks include the policy rate, reserve requirements, 
and open market operations. The policy rate is called the discount rate in the United 
States, the refinancing rate by the ECB, and the two-week repo rate in the United 
Kingdom. 


Decreasing the policy rate, decreasing reserve requirements, and making open 
market purchases of securities are all expansionary. Increasing the policy rate, 
increasing reserve requirements, and making open market sales of securities are all 
contractionary. 


The transmission mechanism for changes in the central bank’s policy rate through to 
prices and inflation includes one or more of the following: 


» Short-term bank lending rates 


» Asset prices 


» Expectations for economic activity and future policy rate changes 


» Exchange rates with foreign currencies 


A contractionary monetary policy (increase in policy rate) will tend to decrease 
economic growth, increase market interest rates, decrease inflation, and lead to 
appreciation of the domestic currency in foreign exchange markets. An expansionary 
monetary policy (decrease in policy rate) will have opposite effects, tending to 
increase economic growth, decrease market interest rates, increase inflation, and 
reduce the value of the currency in foreign exchange markets. 


LOS 15.c 
Effective central banks exhibit independence, credibility, and transparency. 


» Independence. The central bank is free from political interference. 
= Credibility. The central bank follows through on its stated policy intentions. 


= Transparency. The central bank makes it clear what economic indicators it uses 
and reports on the state of those indicators. 


Most central banks set target inflation rates, typically 2%-3%, rather than targeting 
interest rates, as was once common. When inflation is expected to rise above (fall 
below) the target band, the money supply is decreased (increased) to reduce 
(increase) economic activity. 


Developing economies sometimes target a stable exchange rate for their currency 
relative to that of a developed economy, selling their currency when its value rises 
above the target rate and buying their currency with foreign reserves when the rate 
falls below the target. The developing country must follow a monetary policy that 
supports the target exchange rate and essentially commits to having the same 
inflation rate as the developed country. 


Reasons that monetary policy may not work as intended: 
=» Monetary policy changes may affect inflation expectations to such an extent that 
long-term interest rates move opposite to short-term interest rates. 


= Individuals may be willing to hold greater cash balances without a change in 
short-term rates (liquidity trap). 


» Banks may be unwilling to lend greater amounts, even when they have increased 
excess reserves. 


» Short-term rates cannot be reduced below zero. 

= Developing economies face unique challenges in using monetary policy due to 
undeveloped financial markets, rapid financial innovation, and lack of credibility 
of the monetary authority. 

LOS 15.d 


Interaction of monetary and fiscal policies: 


Monetary Fiscal |Interest Output Private Public 
Policy Policy | Rates Sector Sector 
Spending Spending 


Tight Tight | Higher Lower Lower Lower 
Easy Easy Lower Higher Higher Higher 
Tight Easy Higher Higher Lower Higher 
Easy Tight | Lower Varies Higher Lower 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 15.1 


1.B Central bank goals often include the following: maximum employment, which is 
interpreted as the maximum sustainable growth rate of the economy; stable 
prices; and moderate (not minimum) long-term interest rates. (LOS 15.a) 


2.C The money supply growth rate may need to be adjusted to keep the exchange 
rate within acceptable bounds, but it is not necessarily the same as that of the 
other country. The other two statements are true. (LOS 15.a) 


3.C Decreasing the overnight lending rate would add reserves to the banking 
system, which would encourage bank lending, expand the money supply, reduce 
interest rates, and allow GDP growth and the rate of inflation to increase. 
Selling government securities or increasing the reserve requirement would 
have the opposite effect, reducing the money supply and decreasing the 
inflation rate. (LOS 15.b) 


4.A The primary method by which a central bank conducts monetary policy is 
through changes in the target short-term rate or policy rate. (LOS 15.b) 


5.C Open market purchases by monetary authorities decrease the interbank lending 
rate by increasing excess reserves that banks can lend to one another—and 
therefore, increasing their willingness to lend. (LOS 15.b) 


6.C An increase in the policy rate is likely to increase longer-term interest rates, 
causing decreases in consumption spending on durable goods and business 
investments in plant and equipment. The increase in rates, however, makes 
investment in the domestic economy more attractive to foreign investors, 
increasing demand for the domestic currency and causing the currency to 
appreciate. (LOS 15.b) 


Module Quiz 15.2 


1.C The three qualities of effective central banks are independence, credibility, and 
transparency. (LOS 15.c) 


2.C Exchange rate targeting requires monetary policy to be consistent with the goal 
of a stable exchange rate with the targeted currency, regardless of domestic 
economic conditions. (LOS 15.c) 


3.B Monetary policy has a limited ability to act effectively against deflation 
because the policy rate cannot be reduced below zero, and demand for money 


may be highly elastic (liquidity trap). (LOS 15.c) 


READING 16 
INTRODUCTION TO GEOPOLITICS 


MODULE 16.1: GEOPOLITICS cG 


‘ ne é Video covering 
LOS 16.a: Describe geopolitics from a cooperation versus aeacoukene ic 


competition perspective. available online. 


Geopolitics refers to interactions among nations, including the actions of state 
actors (national governments) and nonstate actors (corporations, nongovernment 
organizations, and individuals). 


Geopolitics also refers to the study of how geography affects interactions among 
nations and their citizens. For example, firms located in coastal countries naturally 
tend to be the dominant participants in international shipping. 


One way to examine geopolitics is through analysis of the extent to which individual 
countries cooperate with one another. Potential areas for cooperation include 
diplomatic and military matters, and economic and cultural interactions. In terms of 
economics, areas of cooperation include freedom of movement across borders for 
goods, services, and capital; agreements to harmonize tariffs; international 
standardization of rules; and transfers of information and technology. 


While a country that engages with other countries on these matters may be 
considered cooperative and one that does not may be considered noncooperative, 
the extent of cooperation actually varies along a spectrum. A country might be more 
cooperative on some issues and less cooperative on others, and its degree of 
cooperation can change over time or with the outcomes of the country’s domestic 
politics. A country’s current decision makers and the length of its political cycle are 
factors to consider when analyzing geopolitics. 


A country will typically cooperate with other countries when doing so advances its 
national interests. For example, a country may cooperate with its neighbors ina 
military alliance if doing so will further its interests in protecting its citizens from 
foreign invaders. 


We can analyze a country’s national interests as a hierarchy, with its top priorities 
being those that ensure its survival. A country’s geophysical resource endowment 
may influence its priorities. For example, a country that has mineral resources but 
lacks arable land needs to trade minerals for food; therefore, it has an interest in 
cooperating with other countries to keep international trade lanes open. 


Nonstate actors often have interests in cooperating across borders. Individuals and 
firms seek to direct their resources to their highest-valued uses, and some of those 
uses may be in other countries. To facilitate the flow of resources, state and nonstate 
actors may cooperate on standardization of regulations and processes. One key 
example of standardization among countries is International Financial Reporting 
Standards for firms presenting their accounting data to the public, which we will 
examine in the Financial Statement Analysis topic area. 


Cultural factors, such as historical emigration patterns or a shared language, can be 
another influence on a country’s level of cooperation. Among these cultural factors 
are a country’s formal and informal institutions, such as laws, public and private 
organizations, or distinct customs and habits. Strong and stable institutions can 
make cooperation easier for state and nonstate actors. For example, countries that 
produce and export large amounts of cultural content tend to be those with legal and 
ethical institutions that protect intellectual property. Cultural exchange is one 
means through which a country may exercise soft power, which is the ability to 
influence other countries without using or threatening force. 


LOS 16.b: Describe geopolitics and its relationship with globalization. 


Globalization refers to the long-term trend toward worldwide integration of 
economic activity and cultures. Data from the World Bank suggest economic 
openness, as measured by international trade as a percentage of total output, 
increased steadily from about 25% in the early 1970s to about 60% before the 2008 
financial crisis, and has remained near that level since then. We may contrast 
globalization with nationalism—which, in this context, refers to a nation pursuing 
its own economic interests independently of, or in competition with, the economic 
interests of other countries. 


PROFESSOR’S NOTE 

Debate about what the word nationalism means is beyond the scope of 
the CFA curriculum. We use it here only in the sense of opposition or 
resistance to globalization. 


As we did with cooperation versus noncooperation, we can think of countries’ 
actions along a spectrum from globalization to nationalism. In general, countries 
that are closer to the globalization end of the spectrum are those that more actively 
import and export goods and services, permit freer movement of capital across 
borders and exchange of currencies, and are more open to cultural interaction. 


In Figure 16.1, we draw each spectrum as an axis. This creates four quadrants, each 
of which we can associate with a type of behavior by countries. While individual 
countries rarely fit neatly into one of these categories, this gives us a general 
framework within which we can describe geopolitical actions. 


Figure 16.1: Archetypes of Globalization and Cooperation 


Globalization 


Hegemony Multilateralism 
Non-cooperation <—— t » Cooperation 
Autarky Bilateralism 


Nationalism/ 
Anti-Globalization 


Source: Reproduced from Level | CFA Curriculum learning module, “Introduction to Geopolitics,” with 
permission from CFA Institute. 


Characteristics we may associate with each of these categories are as follows: 


» Autarky (noncooperation and nationalism) refers to a goal of national self- 
reliance, including producing most or all necessary goods and services 
domestically. Autarky is often associated with a state-dominated society in 
general, with attributes such as government control of industry and media. 


» Hegemony (noncooperation and globalization) refers to countries that are open 
to globalization but have the size and scale to influence other countries without 
necessarily cooperating. 


» Bilateralism (cooperation and nationalism) refers to cooperation between two 
countries. A country that engages in bilateralism may have many such 
relationships with other countries while tending not to involve itself in 
multicountry arrangements. 


» Multilateralism (cooperation and globalization) refers to countries that engage 
extensively in international trade and other forms of cooperation with many other 
countries. Some countries may exhibit regionalism, cooperating multilaterally 
with nearby countries but less so with the world at large. 


Some of the nonstate actors within a country may be more oriented toward 
globalization than their governments. Businesses may look outside their home 
country for opportunities to increase profits, reduce costs, and sell to new markets. 
Investors may seek higher returns or diversification by investing outside their home 
country. Nonstate actors might buy and sell foreign securities (portfolio 
investment flows) or own physical production capacity in other countries (foreign 
direct investment). 


LOS 16.c: Describe functions and objectives of the international organizations 
that facilitate trade, including the World Bank, the International Monetary 
Fund, and the World Trade Organization. 


Perhaps the best way to understand the roles of the organizations designed to 
facilitate trade is to examine their own statements. 


The following is according to the International Monetary Fund (IMF) (more 
available at www.IMEorg): 


Article I of the Articles of Agreement sets out the IMF’s main goals: 


» Promoting international monetary cooperation 

« Facilitating the expansion and balanced growth of international trade 
» Promoting exchange stability 

» Assisting in the establishment of a multilateral system of payments 


» Making resources available (with adequate safeguards) to members experiencing 
balance of payments difficulties 


The following is according to the World Bank (www.worldbank.org): 


The World Bank is a vital source of financial and technical assistance to developing countries 
around the world. Our mission is to fight poverty with passion and professionalism for lasting 
results and to help people help themselves and their environment by providing resources, 
sharing knowledge, building capacity, and forging partnerships in the public and private 
sectors. 


We are nota bank in the common sense; we are made up of two unique development 


institutions owned by 187 member countries: the International Bank for Reconstruction and 
Development (IBRD) and the International Development Association (IDA). 


Each institution plays a different but collaborative role in advancing the vision of inclusive and 
sustainable globalization. The IBRD aims to reduce poverty in middle-income and 
creditworthy poorer countries, while IDA focuses on the world’s poorest countries. 

[...| Together, we provide low-interest loans, interest-free credits, and grants to developing 
countries for a wide array of purposes that include investments in education, health, public 
administration, infrastructure, financial and private sector development, agriculture, and 
environmental and natural resource management. 


The following is according to the World Trade Organization (WTO) (more 
available at www.WTO.org): 


The World Trade Organization (WTO) is the only international organization dealing with the 
global rules of trade between nations. Its main function is to ensure that trade flows as 
smoothly, predictably, and freely as possible. 

[...] Trade friction is channeled into the WTO’s dispute settlement process where the focus is 
on interpreting agreements and commitments, and how to ensure that countries’ trade 
policies conform with them. That way, the risk of disputes spilling over into political or 
military conflict is reduced. 

[...] At the heart of the system—known as the multilateral trading system—are the WTO’s 
agreements, negotiated and signed by a large majority of the world’s trading nations, and 
ratified in their parliaments. These agreements are the legal ground-rules for international 
commerce. Essentially, they are contracts, guaranteeing member countries important trade 
rights. They also bind governments to keep their trade policies within agreed limits to 
everybody’s benefit. 


LOS 16.d: Describe geopolitical risk. 


Geopolitical risk is the possibility of events that interrupt peaceful international 
relations. We can classify geopolitical risk into three types: 


1. Event risk refers to events about which we know the timing but not the outcome, 
such as national elections. 


2. Exogenous risk refers to unanticipated events, such as outbreaks of war or 
rebellion. 


3. Thematic risk refers to known factors that have effects over long periods, such as 
human migration patterns or cyber risks. 


Geopolitical risk affects investment values by increasing or decreasing the risk 
premium investors require to hold assets in a country or region. To forecast the 
effect on investments of a geopolitical risk, we need to consider its probability 
(likelihood), the magnitude of its effects on investment outcomes (impact), and how 
quickly investment values would reflect these effects (velocity). 


We can use our framework of cooperation and globalization to help estimate the 
likelihood of geopolitical risk. Countries that are more cooperative and globalized 
tend to have less likelihood of some geopolitical risks, such as armed conflict, but 
may have greater likelihood of other risks, such as the supply chain disruptions that 
followed the COVID-19 pandemic in 2020 and 2021. 


To analyze the velocity of geopolitical risk, we can classify risks as high velocity 
(short term), medium velocity, or low velocity (long term). Exogenous risks often 
have high-velocity effects on financial markets and investment values. Black swan 
risk is a term for the risk of low-likelihood exogenous events that have substantial 
short-term effects. Investors with longer time horizons typically do not need to 
react to these kinds of events, but investors with shorter horizons might find it 
necessary to react. 


Medium-velocity risks can potentially damage specific companies or industries by 
increasing their costs or disrupting their production processes, while low-velocity 
risks tend to affect them in the “environmental, social, and governance” realm. 
Analyzing these kinds of risk is important for investors with long time horizons. 


LOS 16.e: Describe tools of geopolitics and their impact on regions and 
economies. 


We can consider tools of geopolitics, the means by which (primarily) state actors 
advance their interests in the world, as falling into three broad categories of national 
security, economic, and financial. 


National security tools may include armed conflict, espionage, or bilateral or 
multilateral agreements designed to reinforce or prevent armed conflict. We can say 
a national security tool is active if a country is currently using it or threatened if a 
country is not currently using it but appears likely to do so. Armed conflict affects 
regions and economies by destroying productive capital and causing migration away 
from areas of conflict. 


Economic tools can be cooperative or noncooperative. Examples of cooperative 
economic tools include free trade areas, common markets, and economic and 
monetary unions (each of which we describe in our reading on international trade 
and capital flows). Examples of noncooperative economic tools include domestic 
content requirements, voluntary export restraints, and nationalization (i.e., the state 
taking control) of companies or industries. 


Financial tools include foreign investment and the exchange of currencies. We can 
view countries as using these tools cooperatively if they allow foreign investment 
and the free exchange of currencies, or noncooperatively when they restrict these 
activities. Sanctions, or restrictions ona specific geopolitical actor’s financial 
interests, are a financial tool that state actors may use alongside national security 
tools. 


LOS 16.f: Describe the impact of geopolitical risk on investments. 


Because analyzing geopolitical risks requires effort, time, and resources, investors 
should consider whether the impact of geopolitical risk is likely to be high or low, 
and focus their analysis on risks that could have a high impact. With regard to those 
risks, investors should determine whether they are likely to have discrete impacts on 
a company or industry or broad impacts ona country, a region, or the world. 
Business cycles can affect the impact of geopolitical risk, in that these risks may 
have greater impacts on investment values when an economy is in recession than 
they would have during an expansion. 


Investors can use qualitative or quantitative scenario analysis to gauge the 
potential effects of geopolitical risks on their portfolios. To help identify geopolitical 
risks over time, investors may identify signposts, or data that can signal when the 
likelihood of an event is increasing or decreasing, such as volatility indicators in 
financial markets. 


—» MODULE QUIZ 16.1 
1. A state actor that is generally cooperative with other countries and primarily 
nationalist in pursuing its objectives is most accurately said to exhibit: 
A. autarky. 
B. hegemony. 


C. bilateralism. 


2. Which of the following tools of geopolitics is best described as a noncooperative 
economic tool? 
A. Voluntary export restraints. 
B. Regional free trade agreements. 
C. Restrictions on conversion of currencies. 


3. When investing for a long time horizon, a portfolio manager should most likely 
devote resources to analyzing: 
A. event risks. 


B. thematic risks. 
C. exogenous risks. 
4. Which international organization is primarily concerned with providing economic 
assistance to developing countries? 
A. World Bank. 
B. World Trade Organization. 
C. International Monetary Fund. 


KEY CONCEPTS 


LOS 16.a 


Geopolitics refers to interactions among nations. On various issues ranging from 
diplomacy and military force to economic or cultural openness, countries lie along a 
spectrum from cooperative to noncooperative. 


LOS 16.b 


Globalization refers to integration of economic activity and cultures among 
countries, and can be contrasted with nationalism, which refers to a country 
pursuing its own interests independently of other countries. Analysts should view 
geopolitical actions as being on a spectrum from nationalism to globalization. 


We may describe geopolitics and its relationship with globalization using the 
following four broad categories: autarky (noncooperation and nationalism), 
hegemony (noncooperation and globalization), bilateralism (cooperation and 
nationalism), and multilateralism (cooperation and globalization). 


LOS 16.c 

The International Monetary Fund facilitates trade by promoting international 
monetary cooperation and exchange rate stability, assists in setting up international 
payments systems, and makes resources available to member countries with balance 
of payments problems. 


The World Bank provides low-interest loans, interest-free credits, and grants to 
developing countries for many specific purposes. It also provides resources and 
knowledge and helps form private/public partnerships with the overall goal of 

fighting poverty. 


The World Trade Organization has the goal of ensuring that trade flows freely and 
works smoothly. Its main focus is on instituting, interpreting, and enforcing 
numerous multilateral trade agreements that detail global trade policies for a large 
majority of the world’s trading nations. 


LOS 16.d 


Categories of geopolitical risk are event risk (when the timing is known), exogenous 
risk (unanticipated events), and thematic risk (known factors that have long-term 
effects). 


LOS 16.e 
Tools of geopolitics include national security tools, economic tools, and financial 
tools. 


National security tools may include armed conflict, espionage, or bilateral or 
multilateral national security agreements. 


Cooperative economic tools include free trade areas, common markets, and 
economic and monetary unions. Noncooperative economic tools include domestic 
content requirements, voluntary export restraints, and nationalization. 


Financial tools include foreign investment, exchange of currencies, and sanctions. 


LOS 16.f 


Investors should analyze the likelihood of a geopolitical risk, the impact on 
investment values of an event if it occurs, and the velocity with which it would 


affect investment values. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 16.1 


1.C Bilateralism is characterized by nationalism (as contrasted with globalization) 
and cooperation. Both autarky and hegemony are characterized by 
noncooperation. (LOS 16.b) 


2.A Voluntary export restraints (exporting less of a good than the global market 
demands) are an example of a noncooperative economic tool. Restrictions on 
the exchange of currencies are a financial tool. Free trade agreements are a 
cooperative economic tool. (LOS 16.e) 


3.B Thematic risks are those that have effects over the long term. Event risks and 
exogenous risks are more likely to have high-velocity impacts on investment 
values, but they are less of a focus for investors with longer time horizons. (LOS 
16.d) 


4.A The World Bank provides technical and financial assistance to economically 
developing countries. The World Trade Organization is primarily concerned 
with settling disputes among countries concerning international trade. The 
International Monetary Fund promotes international trade and exchange rate 
stability and assists member countries that experience balance of payments 
trouble. (LOS 16.c) 


READING 17 
INTERNATIONAL TRADE 


MODULE 17.1: INTERNATIONAL TRADE Ga 


Video covering 
this content is 
available online. 


LOS 17.a: Describe the benefits and costs of international trade. 


Historically, economic models of trade have focused on the gains that 

result when countries with lower relative cost of (a comparative advantage in) the 
production of a good specialize in producing that good and export it, importing 
goods for which other countries have a lower relative cost or production. This 
increases the total output of goods and the wealth of both countries. Comparative 
advantage results from differences in technology and resource endowments across 
countries. 


Newer models of trade emphasize gains from economies of scale that reduce costs 
of export goods, an increased variety of goods produced, decreasing costs and 
improved quality from additional competition, and more efficient allocation of 
productive resources. 


Free trade can also benefit consumers by reducing the pricing power of domestic 
monopolies. Countries that produce the same good can also operate in market of 
monopolistic competition with differentiated products. Consider the global market 
for automobiles. Several countries export one type of autos and import other types. 
Consumers benefit from the greater variety offered, as well as the reduced costs 
from specialization. 


While domestic consumers of imported goods and domestic producers of exported 
goods both gain, international trade imposes costs as well. The most cited costs of 
free trade are the loss of domestic jobs in an importing industry and increased 
economic inequality. Consider a country with a comparative advantage in the 
production of steel that exports to another country with a higher relative cost of 
steel production. When free trade is permitted, consumers of steel in the importing 
country gain and workers and producers of steel lose. In the exporting country, 
increased demand for steel production may increase prices to domestic consumers, 
while workers and companies in the steel industry will lose, at least in the short run. 


In a situation where labor costs are lower in the exporting country, free trade can 
decrease wages and employment in the domestic economy. Consider textile 
production in a country with a relatively high cost of labor. For a labor-intensive 
industry such as some textile production, importing textiles from a lower-cost 


country will result in job and wage losses in the domestic industry, possibly 
increasing income inequality. 


When we speak of the gains from trade in general, it is based on economic analysis 
suggesting that, overall, the gains from trade are greater than the losses—especially 
in the long run. That is, the gainers could, at least in theory, compensate the losers, 
with net gains shared by both importing and exporting countries. While free trade 
surely imposes costs on the workers and companies in an industry facing 
competition from imports in the short run, it is argued that in the long run—after 
workers receive additional training and find work in other industries—the short- 
run costs of free trade are mitigated to some significant degree, or even reversed. 


LOS 17.b: Compare types of trade restrictions, such as tariffs, quotas, and 
export subsidies, and their economic implications. 


There are many reasons (at least stated reasons) given by governments that impose 
trade restrictions. Some have support among economists as conceivably valid in 
terms of increasing a country’s welfare, while others get little or no support from 
economic theory. Some of the reasons for trade restrictions that have support from 
economists are as follows: 


« Infant industry. Protection from foreign competition is given to new industries to 
give them an opportunity to grow to an internationally competitive scale and get 
up the learning curve in terms of efficient production methods. 


= National security. Even if imports are cheaper, it may be in the country’s best 
interest to protect producers of goods crucial to the country’s national defense so 
that those goods are available domestically in the event of conflict. 


Other arguments for trade restrictions that have little support in theory are as 
follows: 


» Protecting domestic jobs. While some jobs are certainly lost, and some groups and 
regions are negatively affected by free trade, other jobs (in export industries or 
growing domestic goods and services industries) will be created, and prices for 
domestic consumers will be less without import restrictions. 


» Protecting domestic industries. Industry firms often use political influence to get 
protection from foreign competition—usually to the detriment of consumers, who 
pay higher prices. 


Other arguments include retaliation for foreign trade restrictions, government 
collection of tariffs (like taxes on imported goods), countering the effects of 
government subsidies paid to foreign producers, and preventing foreign exports at 
less than their cost of production (dumping). 


Types of trade restrictions include the following: 


« Tariffs. These are taxes on imported goods collected by the government. 
= Quotas. These are limits on the amount of imports allowed over some period. 


» Export subsidies. These are government payments to firms that export goods. 


=» Minimum domestic content. This is the requirement that some percentage of 
product content must be from the domestic country. 


= Voluntary export restraint. A country voluntarily restricts the amount of a good 
that can be exported, often in the hope of avoiding tariffs or quotas imposed by its 
trading partners. 


Economic Implications of Trade Restrictions 


We will now examine the effects of the primary types of trade restrictions, tariffs, 
and subsidies. 


A tariff placed on an imported good increases its domestic price, decreases the 
quantity imported, and increases the quantity supplied domestically. Domestic 
producers gain, foreign exporters lose, and the domestic government gains by the 
amount of the tariff revenues. 


A quota restricts the quantity of a good imported to the quota amount. Domestic 
producers gain, and domestic consumers lose from an increase in the domestic 
price. The right to export a specific quantity to the domestic country is granted by 
the domestic government, which may or may not charge for the import licenses to 
foreign countries. If the import licenses are sold, the domestic government gains the 
revenue. 


We illustrate the overall welfare effects of quotas and tariffs for a small country in 
Figure 17.1. We define a quota that is equivalent to a given tariff as a quota that will 
result in the same decrease in the quantity of a good imported as the tariff. Defined 
this way, a tariff and an equivalent quota both increase the domestic price from 
Pworla the price that prevails with no trade restriction, to Protection: 


At Pong before any restriction, the domestic quantity supplied is QS,, and the 
domestic quantity demanded is QD,, with the difference equal to the quantity 
imported, QD, — QS). Placing a tariff on imports increases the domestic price to 
Pprotectiow increases the domestic quantity supplied to QS2, and decreases the 
domestic quantity demanded to QD>. The difference is the new quantity imported. 
An equivalent quota will have the same effect, decreasing the quantity imported to 


QD2 — QSp. 


The entire shaded area in Figure 17.1 represents the loss of consumer surplus in the 
domestic economy. The portion with vertical lines, the area to the left of the 
domestic supply curve between Ppyotection ANd Pworg represents the gain in the 
producer surplus of domestic producers. The portion with horizontal lines, the area 
bounded by QD - QS2 and Porotection ~ Pwonw represents the gain to the domestic 
government from tariff revenue. The two remaining triangular areas are the 
deadweight loss from the restriction on free trade. 


Figure 17.1: Effects of Tariffs and Quotas 
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In the case of a quota, if the domestic government collects the full value of the 
import licenses, the result is the same as for a tariff. If the domestic government 
does not charge for the import licenses, this amount is a gain to those foreign 
exporters who receive the import licenses under the quota and are termed quota 
rents. 


In terms of overall economic gains from trade, the deadweight loss is the amount of 
lost welfare from the imposition of a quota or tariff. From the viewpoint of the 
domestic country, the loss in consumer surplus is only partially offset by the gains in 
domestic producer surplus and the collection of tariff revenue. 


If none of the quota rents are captured by the domestic government, the overall 
welfare loss to the domestic economy is greater by the amount of the quota rents. It 
is the entire difference between the gain in producer surplus and the loss of 
consumer surplus. 


A voluntary export restraint (VER) is just as it sounds. It refers to a voluntary 
agreement by a government to limit the quantity of a good that can be exported. 
VERs are another way of protecting the domestic producers in the importing 
country. They result in a welfare loss to the importing country equal to that of an 
equivalent quota with no government charge for the import licenses—that is, no 
capture of the quota rents. 


Export subsidies are payments by a government to its country’s exporters. Export 
subsidies benefit producers (exporters) of the good but increase prices and reduce 
consumer surplus in the exporting country. In a small country, the price will 
increase by the amount of the subsidy to equal the world price plus the subsidy. In 
the case of a large exporter of the good, the world price decreases and some benefits 
from the subsidy accrue to foreign consumers, while foreign producers are 
negatively affected. 


Most of the effects of all four of these protectionist policies are the same. With 
respect to the domestic (importing) country, import quotas, tariffs, and VERs all do 
the following: 


= Reduce imports 


= Increase price 


» Decrease consumer surplus 
= Increase domestic quantity supplied 


= Increase producer surplus 


With one exception, all will decrease national welfare. Quotas and tariffs in a large 
country could increase national welfare under a specific set of assumptions, 
primarily because for a country that imports a large amount of the good, setting a 
quota or tariff could reduce the world price for the good. 


Capital Restrictions 


Some countries impose capital restrictions on the flow of financial capital across 
borders. Restrictions include outright prohibition of investment in the domestic 
country by foreigners, prohibition of or taxes on the income earned on foreign 
investments by domestic citizens, prohibition of foreign investments in certain 
domestic industries, and restrictions on repatriation of earnings of foreign entities 
operating in a country. 


Overall, capital restrictions are thought to decrease economic welfare. However, 
over the short term, they have helped developing countries avoid the impact of great 
inflows of foreign capital during periods of optimistic expansion and the impact of 
large outflows of foreign capital during periods of correction and market unease, or 
outright panic. Even these short-term benefits may not offset longer-term costs if 
the country is excluded from international markets for financial capital flows. 


LOS 17.c: Explain motivations for and advantages of trading blocs, common 
markets, and economic unions. 


There are various types of agreements among countries with respect to trade policy. 
The essence of all of them is to reduce trade barriers among the countries. 
Reductions in trade restrictions among countries have some, by now familiar, 
positive and negative effects on economic welfare. The positive effects result from 
increased trade according to comparative advantage, as well as increased 
competition among firms in member countries. The negative effects result because 
some firms, some industries, and some groups of workers will see their wealth and 
incomes decrease. Workers in affected industries may need to learn new skills to get 
new jobs. 


On balance, economic welfare is improved by reducing or eliminating trade 
restrictions. Note, however, that to the extent that a trade agreement increases trade 
restrictions on imports from nonmember countries, economic welfare gains are 
reduced and, in an extreme case, could be outweighed by the costs such restrictions 
impose. This could result if restrictions on trade with nonmember countries 
increase a country’s (unrestricted) imports from a member that has higher prices 
than the country’s previous imports from a nonmember. 


We list these types of agreements, generally referred to as trading blocs or 
regional trading agreements (RTAs), in order of their degrees of integration. 


Free Trade Areas 


1. All barriers to import and export of goods and services among member countries 
are removed. 


Customs Union 


1. All barriers to import and export of goods and services among member countries 
are removed. 


2. All countries adopt a common set of trade restrictions with nonmembers. 


Common Market 


1. All barriers to import and export of goods and services among the countries are 
removed. 


2. All countries adopt a common set of trade restrictions with nonmembers. 
3. All barriers to the movement of labor and capital goods among member countries 
are removed. 


Economic Union 


1. All barriers to import and export of goods and services among the countries are 
removed. 


2. All countries adopt a common set of trade restrictions with nonmembers. 


3. All barriers to the movement of labor and capital goods among member countries 
are removed. 


4. Member countries establish common institutions and economic policy for the 
union. 


Monetary Union 


1. All barriers to import and export of goods and services among the countries are 
removed. 


2. All countries adopt a common set of trade restrictions with nonmembers. 


3. All barriers to the movement of labor and capital goods among member countries 
are removed. 


4. Member countries establish common institutions and economic policy for the 
union. 


5. Member countries adopt a single currency. 
The North American Free Trade Agreement (NAFTA) is an example of a free trade 


area, the European Union (EU) is an example of an economic union, and the 
Eurozone is an example of a monetary union. 


=>» MODULE QUIZ 17.1 


1. Which of the following effects is most likely to occur in a country that increases its 
openness to international trade? 
A. Increased prices of consumer goods. 
B. Greater specialization in domestic output. 
C. Decreased employment in exporting industries. 


2. An agreement with another country to limit the volume of goods and services sold 
to them is best described as a: 


A. quota. 
B. voluntary export restraint. 
C. minimum domestic content rule. 
3. Which of the following groups would be most likely to suffer losses from the 
imposition of a tariff on steel imports? 
A. Domestic steel producers. 
B. Workers in the domestic auto industry. 
C. Workers in the domestic steel industry. 
4. The most likely motivation for establishing a trading bloc is to: 
A. increase economic welfare in the member countries. 
B. increase tariff revenue for the member governments. 
C. protect domestic industries in the member economies. 


5. In which type of regional trade agreement are economic policies conducted 
independently by the member countries, while labor and capital are free to move 
among member countries? 

A. Free trade area. 
B. Common market. 
C. Economic union. 


KEY CONCEPTS 


LOS 17.a 

Free trade among countries increases overall economic welfare. Countries can 
benefit from trade because one country can specialize in the production of an export 
good and benefit from economies of scale. Economic welfare can also be increased 
by greater product variety, more competition, and a more efficient allocation of 
resources. 


Costs of free trade are primarily losses to those in domestic industries that lose 
business to foreign competition, especially less efficient producers who leave an 
industry. While other domestic industries will benefit from freer trade policies, 
unemployment may increase over the period in which workers are retrained for 
jobs in the expanding industries. Some argue that greater income inequality may 
result, but overall, the gains from liberalization of trade policies are thought to 
exceed the costs, so that the winners could conceivably compensate the losers and 
still be better off. 


LOS 17.b 
Types of trade restrictions include the following: 


« Tariffs. These are taxes on imported goods collected by the government. 
= Quotas. These are limits on the amount of imports allowed over some period. 


« Minimum domestic content. This is the requirement that some percentage of 
product content must be from the domestic country. 


« Voluntary export restraints. A country voluntarily restricts the amount of a good 
that can be exported, often in the hope of avoiding tariffs or quotas imposed by 
their trading partners. 


Within each importing country, all of these restrictions will tend to do the 
following: 


» Increase prices of imports and decrease quantities of imports 


» Increase demand for and quantity supplied of domestically produced goods 


» Increase producer’s surplus and decrease consumer surplus 


Export subsidies decrease export prices and benefit importing countries at the 
expense of the government of the exporting country. 


Restrictions on the flow of financial capital across borders include outright 
prohibition of investment in the domestic country by foreigners, prohibition of or 
taxes on the income earned on foreign investments by domestic citizens, prohibition 
of foreign investments in certain domestic industries, and restrictions on 
repatriation of earnings of foreign entities operating in a country. 


LOS 17.c 


Trade agreements, which increase economic welfare by facilitating trade among 
member countries, take the following forms: 


Free trade area. All barriers to the import and export of goods and services among 
member countries are removed. 


Customs union. Member countries also adopt a common set of trade restrictions 
with nonmembers. 


Common market. Member countries also remove all barriers to the movement of 
labor and capital goods among members. 

Economic union. Member countries also establish common institutions and 
economic policy for the union. 


Monetary union. Member countries also adopt a single currency to reduce the cost 
of cross-border trade. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 17.1 


1.B Openness to international trade increases specialization as production shifts to 


those products in which domestic producers have a comparative advantage. 
Greater competition from imports will tend to decrease prices for consumer 
goods. Increasing international trade is likely to increase profitability and 
employment in exporting industries, but it may decrease profitability and 
employment in industries that compete with imported goods. (LOS 17.a) 


2.B Voluntary export restraints are agreements to limit the volume of goods and 


services exported to another country. Minimum domestic content rules are 
limitations imposed by a government on its domestic firms. Import quotas are 
limitations on imports, not on exports. (LOS 17.b) 


3.B Imposing a tariff on steel imports benefits domestic steel producers and 


workers by increasing the domestic price of steel and benefits the national 
government by increasing tax (tariff) revenue. However, the increase in the 
domestic price of steel would increase costs in industries that use significant 
amounts of steel, such as the automobile industry. The resulting increase in the 


price of automobiles reduces the quantity of automobiles demanded, and it 
ultimately reduces employment in that industry. (LOS 17.b) 


4.A The motivation for trading blocs is to increase economic welfare in the member 
countries by eliminating barriers to trade. Joining a trading bloc may have 
negative consequences for some domestic industries and may decrease tariff 
revenue for the government. (LOS 17.c) 


5.B These characteristics describe a common market. In a free trade area, member 
countries remove restrictions on goods and services and trade with one 
another but may still restrict movement of labor and capital among member 
countries. In an economic union, member countries also coordinate their 
economic policies and institutions. (LOS 17.c) 


READING 18 


CAPITAL FLOWS AND THE FX 
MARKET 


MODULE 18.1: THE FOREIGN EXCHANGE A 
MARKET 


Video covering 
this content is 


LOS 18.a: Describe the foreign exchange market, including its = 2vailable online. 
functions and participants, distinguish between nominal and 

real exchange rates, and calculate and interpret the percentage change ina 
currency relative to another currency. 


Foreign currency markets serve companies and individuals that purchase or sell 
foreign goods and services denominated in foreign currencies. An even larger 
market, however, exists for capital flows. Foreign currencies are needed to purchase 
foreign physical assets as well as foreign financial securities. 


Many companies have foreign exchange risk arising from their cross-border 
transactions. A Japanese company that expects to receive 10 million euros when a 
transaction is completed in 90 days has yen/euro exchange rate risk as a result. By 
entering into a forward currency contract to sell 10 million euros in 90 days for a 
specific quantity of yen, the firm can reduce or eliminate the foreign exchange risk 
associated with the transaction. When a firm takes a position in the foreign exchange 
market to reduce an existing risk, we say the firm is hedging its risk. 


Alternatively, when a transaction in the foreign exchange markets increases 
currency risk, we describe the transaction or position as speculating. Investors, 
companies, and financial institutions, such as banks and investment funds, all 
regularly enter into speculative foreign currency transactions. 


The primary dealers in foreign exchange (FX) and originators of forward FX 
contracts are large multinational banks. This part of the FX market is often called 
the sell side. On the other hand, the buy side consists of the many buyers of foreign 
currencies and forward FX contracts. These buyers include the following: 


« Corporations regularly engage in cross-border transactions, purchase and sell 
foreign currencies as a result, and enter into FX forward contracts to hedge the 
risk of expected future receipts and payments denominated in foreign currencies. 


« Investment accounts of many types transact in foreign currencies, hold foreign 


securities, and may both speculate and hedge with currency derivatives. Real 
money accounts refer to mutual funds, pension funds, insurance companies, and 


other institutional accounts that do not use derivatives. Leveraged accounts 
refer to the various types of investment firms that use derivatives, including hedge 
funds, firms that trade for their own accounts, and other trading firms of various 
types. 

= Governments and government entities, including sovereign wealth funds and 
pension funds, acquire foreign exchange for transactional needs, investment, or 
speculation. Central banks sometimes engage in FX transactions to affect exchange 
rates in the short term in accordance with government policy. 


« The retail FX market refers to FX transactions by households and relatively small 
institutions, and it may be for tourism, cross-border investment, or speculative 
trading. 


Types of Exchange Rates 


An exchange rate is simply the price or cost of units of one currency in terms of 
another. In this book we state exchange rates in the form 1.416 USD/EUR, to mean 
that each euro costs $1.416. If you read the “/” as per, you will have no trouble with 
the notation. We say the exchange rate is $1.416 per euro. 


PROFESSOR’S NOTE 

This is the way exchange rates are quoted in the Level I CFA curriculum. 
Foreign exchange traders typically use the inverse (an indirect 
quotation). 


In a foreign currency quotation, we have the price of one currency in units of 
another currency. These are often referred to as the base currency and the price 
currency. In the quotation 1.25 USD/EUR, the USD is the price currency and the 
EUR is the base currency. The price of one euro (base currency) is 1.25 USD (the 
price currency), so 1.25 is the price of one unit of the base currency in terms of the 
other. It may help to remember that the euro, in this example, is in the bottom or 
“base” of the exchange rate given in terms of USD/EUR. 


An exchange rate expressed as price currency/base currency is referred to as a 
direct quote from the point of view of an investor in the price currency country 
and an indirect quote from the point of view of an investor in the base currency 
country. For example, a quote of 1.17 USD/EUR would be a direct quote for a USD- 
based investor and an indirect quote for a EUR-based investor. Conversely, a quote of 
1 / 1.17 = 0.855 EUR/USD would be a direct quote for a EUR-based investor and an 
indirect quote for a USD-based investor. 


The exchange rate at a point in time is referred to as a nominal exchange rate. If 
this rate (price/base) increases, the cost of a unit of the base currency in terms of 
the price currency has increased, so that the purchasing power of the price currency 
has decreased. If the USD/EUR exchange rate increases from 1.10 to 1.15, the cost of 
100 euros increases from $110 to $115. The purchasing power of the dollar has 
decreased relative to the euro because the cost of 100 euros’ worth of goods to a 
consumer in the United States has increased over the period. 


The purchasing power of one currency relative to another is also affected by changes 
in the price levels of the two countries. The real exchange rate between two 


currencies refers to the purchasing power of one currency in terms of the amount of 
goods priced in another currency, relative to an earlier (base) period. 


Consider a situation in which the nominal USD/EUR exchange rate is unchanged at 
1.00 over a period and the price level in the United States is unchanged, while prices 
in the Eurozone have increased by 5%. Eurozone goods that cost 100 euros at the 
beginning of the period cost 105 euros at the end of the period. With the nominal 
exchange rate unchanged, the purchasing power of the USD in the Eurozone has 
decreased, because exchanging 100 USD for 100 EUR will now buy only 100 / 105 = 
95.2% of the goods 100 EUR could buy at the beginning of the period. 


Here is a summary: 


» Anincrease in the nominal USD/EUR rate decreases the purchasing power of the 
USD in the Eurozone (and increases the purchasing power of the EUR in the United 
States); the real USD/EUR exchange rate has increased. 


» A decrease in the nominal USD/EUR rate increases the purchasing power of the 
USD in the Eurozone (and decreases the purchasing power of the EUR in the 
United States); the real USD/EUR exchange rate has decreased. 


» An increase in the Eurozone price level, relative to the price level in the United 
States, will increase the real USD/EUR exchange rate, decreasing the purchasing 
power of the USD in the Eurozone (and increasing the purchasing power of the 
EUR in the United States). 


= A decrease in the Eurozone price level, relative to the price level in the United 
States, will decrease the real USD/EUR exchange rate, increasing the purchasing 
power of the USD in the Eurozone (and decreasing the purchasing power of the 
EUR in the United States). 


The end-of-period real P/B exchange rate can be calculated as follows: 


CP lyase currency 


real P/B exchange rate = nominal P/B exchange rate x ( CPI 


price currency, 
where the CPI values are relative to base period values of 100. 
We can see from the formula the following: 


» An increase (decrease) in the nominal exchange rate over the period increases 
(decreases) the end-of period real exchange rate, and the purchasing power of the 
price currency decreases (increases). 


» An increase in the price level in the price currency country relative to the price 
level in the base currency country will decrease the real exchange rate, increasing 
the purchasing power of the price currency in terms of base country goods. 

= Conversely, a decrease in the price level in the price currency country relative to 
the price level in the base currency country will increase the real exchange rate, 
decreasing the purchasing power of the price currency in terms of base country 
goods. 


In the following example, we calculate the end-of-period real $/£ exchange rate 
when the nominal $/£ exchange rate has decreased over the period (which tends to 
decrease the real exchange rate and increase the purchasing power of the price 
currency )—and when the price level in the United Kingdom has increased by more 


than the price level in the United States over the period (which tends to increase the 
real exchange rate and decrease the purchasing power of the price currency). The 
relative increase in U.K. prices has reduced the effects of the decrease in the nominal 
exchange rate on the increase in the purchasing power of the USD. 


EXAMPLE: Real exchange rate 


At a base period, the CPIs of the United States and United Kingdom are both 100, 
and the exchange rate is $1.70/£. Three years later, the exchange rate is $1.60/£, 
and the CPI has risen to 110 in the United States and 112 in the United Kingdom. 
What is the real exchange rate at the end of the three-year period? 


Answer: 


The real exchange rate is $1.60 / £ x 112 / 110 = $1.629 / £, which means that U.S. 
goods and services that cost $1.70 at the base period now cost only $1.629 (in 
real terms) if purchased in the United Kingdom and the real exchange rate, $/£, 
has fallen. The decrease in the real exchange rate (and the increase in the 
purchasing power of the USD in terms of U.K. goods) over the period is less than it 
would have been if the relative prices between the two countries had not 
changed. 


A spot exchange rate is the currency exchange rate for immediate delivery, which 
for most currencies means the exchange of currencies takes place two days after the 
trade. 


A forward exchange rate is a currency exchange rate for an exchange to be done in 
the future. Forward rates are quoted for various future dates (e.g., 30 days, 60 days, 
90 days, or one year). A forward is actually an agreement to exchange a specific 
amount of one currency for a specific amount of another on a future date specified 
in the forward agreement. 


A French firm that will receive 10 million GBP from a British firm six months from 
now has uncertainty about the amount of euros that payment will be equivalent to 
six months from now. By entering into a forward agreement covering 10 million GBP 
at the 6-month forward rate of 1.192 EUR/GBP, the French firm has agreed to 
exchange 10 million GBP for 11.92 million euros in six months. 


Calculating the Percentage Change in the FX Value 
of a Currency 


Consider a USD/EUR exchange rate that has changed from 1.42 to 1.39 USD/EUR. 
The percentage change in the dollar price of a euro is simply 1.39 / 1.42-1= 
-0.0211 = —-2.11%. Because the dollar price of a euro has fallen, the euro has 
depreciated relative to the dollar, and a euro now buys 2.11% fewer U.S. dollars. It is 
correct to say that the euro has depreciated by 2.11% relative to the dollar. 


On the other hand, it is not correct to say that the dollar has appreciated by 2.11%. 
To calculate the percentage appreciation of the dollar, we need to convert the quotes 
to EUR/USD. So, our beginning quote of 1.42 USD/EUR becomes 1 / 1.42 = 0.7042 
EUR/USD, and our ending quote of 1.39 USD/EUR becomes 1 / 1.39 = 0.7194 
EUR/USD. Using these exchange rates, we can calculate the change in the euro price 


of a dollar as 0.7194 / 0.7042 — 1 = 0.0216 = 2.16%. In this case, it is correct to say 
that the dollar has appreciated 2.16% with respect to the euro. Note that for the 
given exchange rate quotes, the percentage appreciation of the dollar is not the same 
as the percentage depreciation in the euro. 


=» MODULE QUIZ 18.1 
=; 
1. One year ago, the nominal exchange rate for USD/EUR was 1.300. Since then, the 
real exchange rate has increased by 3%. This most likely implies that: 
A. the nominal exchange rate is less than USD/EUR 1.235. 
B. the purchasing power of the euro has increased approximately 3% in terms of 
U.S. goods. 
C. inflation in the euro zone was approximately 3% higher than inflation in the 


United States. 
2. Sell-side participants in the foreign exchange market are most likely to include: 
A. banks. 
B. hedge funds. 
C. insurance companies. 


3. Suppose that the quote for British pounds (GBP) in New York is USD/GBP 1.3110. 
What is the quote for U.S. dollars (USD) in London (GBP/USD)? 
A. 0.3110. 
B. 0.7628. 
C. 1.3110. 
4. The Canadian dollar (CAD) exchange rate with the Japanese yen (JPY) changes 
from JPY/CAD 75 to JPY/CAD 78. The CAD has: 
A. depreciated by 3.8%, and the JPY has appreciated by 4.0%. 
B. appreciated by 3.8%, and the JPY has depreciated by 4.0%. 
C. appreciated by 4.0%, and the JPY has depreciated by 3.8%. 


MODULE 18.2: MANAGING EXCHANGE C) 
RATES 


Video covering 
this content is 
available online. 
LOS 18.b: Describe exchange rate regimes and explain the 


effects of exchange rates on countries’ international trade and 
capital flows. 


The IMF categorizes exchange rate regimes into the following types, which 
include two for countries that do not issue their own currencies and seven for 
countries that issue their own currencies. 


Countries That Do Not Have Their Own Currency 


A country can use the currency of another country (formal dollarization). The 
country cannot have its own monetary policy, as it does not create money or issue 
currency. 


A country can be a member of a monetary union in which several countries use a 
common currency. Within the European Union, for example, most countries use the 
euro. While individual countries give up the ability to set domestic monetary policy, 


they all participate in determining the monetary policy of the European Central 
Bank. 


Countries That Have Their Own Currency 


A currency board arrangement is an explicit commitment to exchange domestic 
currency for a specified foreign currency at a fixed exchange rate. A notable example 
of such an arrangement is Hong Kong. In Hong Kong, currency is (and may be) only 
issued when fully backed by holdings of an equivalent amount of U.S. dollars. The 
Hong Kong Monetary Authority can earn interest on its U.S. dollar balances. With 
dollarization, there is no such income, as the income is earned by the U.S. Federal 
Reserve when it buys interest-bearing assets with the U.S. currency it issues. While 
the monetary authority gives up the ability to conduct independent monetary policy 
and essentially imports the inflation rate of the outside currency, there may be some 
latitude to affect interest rates over the short term. 


In a conventional fixed peg arrangement, a country pegs its currency within 
margins of +1% versus another currency or a basket that includes the currencies of 
its major trading or financial partners. The monetary authority can maintain 
exchange rates within the band by purchasing or selling foreign currencies in the 
foreign exchange markets (direct intervention). In addition, the country can use 
indirect intervention, including changes in interest rate policy, regulation of foreign 
exchange transactions, and convincing people to constrain foreign exchange activity. 
The monetary authority retains more flexibility to conduct monetary policy than 
with dollarization, a monetary union, or a currency board. However, changes in 
policy are constrained by the requirements of the peg. 


In a system of pegged exchange rates within horizontal bands or a target zone, the 
permitted fluctuations in currency value relative to another currency or basket of 
currencies are wider (e.g., +2%). Compared to a conventional peg, the monetary 
authority has more policy discretion because the bands are wider. 


With a crawling peg, the exchange rate is adjusted periodically, typically to adjust 
for higher inflation versus the currency used in the peg. This is termed a passive 
crawling peg as opposed to an active crawling peg, in which a series of exchange rate 
adjustments over time is announced and implemented. An active crawling peg can 
influence inflation expectations, adding some predictability to domestic inflation. 
Monetary policy is restricted in much the same way it is with a fixed peg 
arrangement. 


With management of exchange rates within crawling bands, the width of the 
bands that identify permissible exchange rates is increased over time. This method 
can be used to transition from a fixed peg to a floating rate when the monetary 
authority’s lack of credibility makes an immediate change to floating rates 
impractical. Again, the degree of monetary policy flexibility increases with the 
width of the bands. 


With a system of managed floating exchange rates, the monetary authority 
attempts to influence the exchange rate in response to specific indicators such as the 
balance of payments, inflation rates, or employment without any specific target 
exchange rate or predetermined exchange rate path. Intervention may be direct or 


indirect. Such management of exchange rates may induce trading partners to 
respond in ways that reduce stability. 


When a currency is independently floating, the exchange rate is market 
determined, and foreign exchange market intervention is used only to slow the rate 
of change and reduce short-term fluctuations—not to keep exchange rates at a 
target level. 


Changes in Exchange Rates 


Changes in exchange rates can affect both imports and exports. If the USD/EUR 
exchange rate decreases, the USD has appreciated relative to the EUR. In this case, 
the USD prices of imports from the Eurozone have decreased, and it takes more 
euros to purchase a U.S. good at the new exchange rate. As a result, Eurozone 
imports from the United States (U.S. exports to the Eurozone) will decrease, and U.S. 
imports of Eurozone goods (Eurozone exports to the United States) will increase. 


The effects on the goods market (imports and exports of merchandise) occur more 
slowly than the effects on capital flows between countries. This is reflected ina 
country’s balance of payments. Essentially, the balance of payments refers to the 
fact that capital flows must offset any imbalance between the value of a country’s 
exports to and imports from another country. 


Consider two countries that have an imbalance between the value of their exports to 
and imports from another country, such as between China and the United States. 
Typically, U.S. imports from China have a greater value than U.S. exports to China. We 
say that the United States has a goods and services trade deficit with China and 
China has a trade surplus with the United States. Capital flows must offset this 
difference. 


Capital flows result primarily from purchases of assets (both physical and financial) 
in one country by investors or governments in other countries. A large portion of 
the capital flows between the United States and China are purchases of U.S. debt 
securities by China. China has a capital account deficit that offsets its trade surplus 
with the United States. 


The relation between the balance of trade and capital flows is expressed by the 
following identity, which was presented in the Economics prerequisite readings: 


(exports — imports) = (private savings — investment in physical capital) 
+ (tax revenue — government spending) 


or: 
(X — M)=(S—-1)+(T- G) 


The intuition is that a trade deficit (X - M < 0) means that the right-hand side must 
also be negative, which implies that total domestic savings (private savings + 
government savings) are less than domestic investment in physical capital. The 
additional amount to fund domestic investment must come from foreigners, which 
results in a surplus in the capital account to offset the deficit in the trade account. 
Another thing we can see from this identity is that any government deficit not 
funded by an excess of domestic saving over domestic investment is consistent with 


a trade deficit (imports > exports), which is offset by an inflow of foreign capital (a 
surplus in the capital account). 


Capital flows adjust more rapidly than spending, savings, and asset prices, so they 

are the primary determinant of exchange rates in the short and intermediate term. 
Trade flows are more important in determining exchange rates in the long term as 
asset prices and saving/investment decisions adjust over time. 


LOS 18.c: Describe common objectives of capital restrictions imposed by 
governments. 


Governments sometimes place restrictions on the flow of investment capital into 
their country, out of their country, or both. Commonly cited objectives of capital 
flow restrictions include the following: 


= Reduce the volatility of domestic asset prices. In times of macroeconomic crisis, 
capital flows out of the country can drive down asset prices drastically, especially 
with prices of liquid assets such as stocks and bonds. With no restrictions on 
inflows or outflows of foreign investment capital, the asset markets of countries 
with economies that are small relative to the amount of foreign investment can be 
quite volatile over a country’s economic cycle. 


« Maintain fixed exchange rates. For countries with fixed exchange rate targets, 
limiting flows of foreign investment capital makes it easier to meet the exchange 
rate target—and, therefore, to be able to use monetary and fiscal policy to pursue 
only the economic goals for the domestic economy. 


= Keep domestic interest rates low. By restricting the outflow of investment capital, 
countries can keep their domestic interest rates low and manage the domestic 
economy with monetary policy, as investors cannot pursue higher rates in foreign 
countries. China is an example of a country with a fixed exchange rate regime 
where restrictions on capital flows allow policymakers to maintain the target 
exchange rate as well as to pursue a monetary policy independent of concerns 
about its effect on currency exchange rates. 


« Protect strategic industries. Governments sometimes prohibit investment by 
foreign entities in industries considered to be important for national security, such 
as the telecommunications and defense industries. 


“<—) MODULE QUIZ 18.2 
——_ 
1. The monetary authority of the Stoddard Islands will exchange its currency for 

U.S. dollars at a one-for-one ratio. As a result, the exchange rate of the Stoddard 
Islands currency with the U.S. dollar is 1.00, and many businesses in the Islands 
will accept U.S. dollars in transactions. This exchange rate regime is best 
described as: 
A. a fixed peg. 
B. dollarization. 
C. a currency board. 


2. Other things equal, which of the following is most likely to decrease a country’s 
trade deficit? 
A. Increase the capital account surplus. 
B. Decrease expenditures relative to income. 


C. Decrease domestic saving relative to domestic investment. 


3. The goal of a government that imposes restrictions on foreign capital flows is most 
likely to: 
A. stimulate domestic interest rates. 
B. decrease domestic asset price volatility. 
C. encourage competition with domestic industries. 


KEY CONCEPTS 


LOS 18.a 


Currency exchange rates are given as the price of one unit of currency in terms of 
another. A nominal exchange rate of 1.44 USD/EUR is interpreted as $1.44 per euro. 
We refer to the USD as the price currency and the EUR as the base currency. 


An increase (decrease) in an exchange rate represents an appreciation (depreciation) 
of the base currency relative to the price currency. 


A spot exchange rate is the rate for immediate delivery. A forward exchange rate is a 
rate for exchange of currencies at some future date. 


A real exchange rate measures changes in relative purchasing power over time: 


CPI : 
real exchange rate = nominal exchange rate x jes | 
’ price currency 


For a change in an exchange rate, we can calculate the percentage appreciation 
(price goes up) or depreciation (price goes down) of the base currency. For example, 
a decrease in the USD/EUR exchange rate from 1.44 to 1.42 represents a 
depreciation of the EUR relative to the USD of 1.39% (1.42 / 1.44 -1=-0.0139) 
because the value of a euro has fallen 1.39%. 


Given the depreciation of the EUR (base currency) of 1.39%, we can calculate the 
appreciation of the price currency as em. = 0.0141 = 1.41%, 


The market for foreign exchange is the largest financial market in terms of the value 
of daily transactions and has various participants, including large multinational 
banks (the sell side) and corporations, investment fund managers, hedge fund 
managers, investors, governments, and central banks (the buy side). 


Participants in the foreign exchange markets are referred to as hedgers if they enter 
into transactions that decrease an existing foreign exchange risk, and as speculators 
if they enter into transactions that increase their foreign exchange risk. 


LOS 18.b 
Exchange rate regimes for countries that do not have their own currency: 


» With formal dollarization, a country uses the currency of another country. 


«» Inamonetary union, several countries use a common currency. 
Exchange rate regimes for countries that have their own currency: 


» Acurrency board arrangement is an explicit commitment to exchange domestic 
currency for a specified foreign currency at a fixed exchange rate. 


In a conventional fixed peg arrangement, a country pegs its currency within 
margins of +1% versus another currency. 


In a system of pegged exchange rates within horizontal bands or a target zone, the 

permitted fluctuations in currency value relative to another currency or basket of 
currencies are wider (e.g., +2%). 

With a crawling peg, the exchange rate is adjusted periodically, typically to adjust 

for higher inflation versus the currency used in the peg. 


With management of exchange rates within crawling bands, the width of the bands 
that identify permissible exchange rates is increased over time. 


With a system of managed floating exchange rates, the monetary authority 
attempts to influence the exchange rate in response to specific indicators, such as 
the balance of payments, inflation rates, or employment, without any specific 
target exchange rate. 


When a currency is independently floating, the exchange rate is market 
determined. 


LOS 18.c 


Commonly cited objectives of capital flow restrictions include the following: 


Reducing the volatility of domestic asset prices 
Maintaining fixed exchange rates 


Keeping domestic interest rates low and enabling greater independence regarding 
monetary policy 


Protecting strategic industries from foreign ownership 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 18.1 


1.B An increase in the real exchange rate USD/EUR (the number of USD per one 


EUR) means that a euro is worth more in purchasing power (real) terms in the 
United States. Changes in a real exchange rate depend on the change in the 
nominal exchange rate relative to the difference in inflation. By itself, a real 
exchange rate does not indicate the directions or degrees of change in either 
the nominal exchange rate or the inflation difference. (LOS 18.a) 


2.A Large multinational banks make up the sell side of the foreign exchange market. 


The buy side includes corporations, real money and leveraged investment 
accounts, governments and government entities, and retail purchasers of 
foreign currencies. (LOS 18.a) 


3.B 1/1.311 = 0.7628 GBP/USD 


(LOS 18.a) 


4.C The CAD has appreciated because it is worth a larger number of JPY. The 


percentage appreciation is (78 - 75) / 75 = 4.0%. To calculate the percentage 
depreciation of the JPY against the CAD, convert the exchange rates to make 


JPY the base currency: 1 / 75 = 0.0133 CAD/JPY and 1 / 78 = 0.0128 CAD/JPY. 
Percentage depreciation = (0.0128 — 0.0133) / 0.0133 = -3.8%. (LOS 18.a) 


Module Quiz 18.2 


1.C This exchange rate regime is a currency board arrangement. The country has 
not formally dollarized because it continues to issue a domestic currency. A 
conventional fixed peg allows for a small degree of fluctuation around the 
target exchange rate. (LOS 18.b) 


2.B An improvement in a trade deficit requires that domestic saving increase 
relative to domestic investment, which would decrease a capital account 
surplus. Decreasing expenditures relative to income means domestic savings 
increase. Decreasing domestic saving relative to domestic investment is 
consistent with a larger capital account surplus (an increase in net foreign 
borrowing) and a greater trade deficit. (LOS 18.b) 


3.B Decreasing the volatility of domestic asset prices may be a goal of a 
government that imposes capital restrictions. Other typical goals include 


keeping domestic interest rates low and protecting certain domestic industries, 
such as the defense industry. (LOS 18.c) 


READING 19 
EXCHANGE RATE CALCULATIONS 


MODULE 19.1: FOREIGN EXCHANGE G 
RATES 


Video covering 
this content is 


LOS 19.a: Calculate and interpret currency cross-rates. available online. 


A cross rate is the exchange rate between two currencies implied by their exchange 
rates with a common third currency. Cross-rates are necessary when there is no 
active FX market in a currency pair. The rate must be computed from the exchange 
rates between each of these two currencies and a third currency, usually the USD or 
EUR. 


Let’s assume that we have the following quotations for Mexican pesos and 
Australian dollars: MXN/USD = 10.70 and USD/AUD = 0.60. The cross rate between 
Australian dollars and pesos (MXN /AUD) is as follows: 


MXN/AUD = USD/AUD x MXN/USD = 0.60 x 10.70 = 6.42 


So, our MXN/AUD cross rate is 6.42 pesos per Australian dollar. The key to 
calculating cross-rates is to note that the basis of the quotations must be such that 
we get the desired result algebraically. If we had started with an AUD/USD quotation 
of 1.67, we would have taken the inverse to get the quotation into USD/AUD terms. 
Another approach is to divide through, as is illustrated in the following example. 


EXAMPLE: Cross rate calculation 


The spot exchange rate between the Swiss franc (CHF) and the USD is CHF/USD = 
1.7799, and the spot exchange rate between the New Zealand dollar (NZD) and the 
US. dollar is NZD/USD = 2.2529. Calculate the CHF/NZD spot rate. 


Answer: 
The CHF/NZD cross rate is (CHF/USD) / (NZD/USD) = 1.7799 / 2.2529 = 0.7900. 


LOS 19.b: Explain the arbitrage relationship between spot and forward 
exchange rates and interest rates, calculate a forward rate using points or in 
percentage terms, and interpret a forward discount or premium. 


When currencies are freely traded and forward currency contracts exist, the 
percentage difference between forward and spot exchange rates is approximately 


equal to the difference between the two countries’ interest rates. This is because 
there is an arbitrage trade with a riskless profit to be made when this relation does 
not hold. 


We call this a no-arbitrage condition because if it doesn’t hold, there is an 
opportunity to make a profit without risk. The possible arbitrage is as follows: 
borrow Currency A at Interest Rate A, convert it to Currency B at the spot rate and 
invest it to earn Interest Rate B, and sell the proceeds from this investment forward 
at the forward rate to turn it back into Currency A. If the forward rate does not 
correctly reflect the difference between interest rates, such an arbitrage could 
generate a profit to the extent that the return from investing Currency B and 
converting it back to Currency A with a forward contract is greater than the cost of 
borrowing Currency A for the period. 


The no-arbitrage condition requires that “you cannot earn more than your domestic 
riskless rate of interest by borrowing your domestic currency, converting it to a 
foreign currency to invest at the foreign riskless rate, and exchanging back to your 
domestic currency.’ So: 


1 
(+5) az, pot, (1 ~ Tfoceign) forward, j, 


The three terms on the right-hand side are equivalent to units of foreign currency 
for one unit of the domestic currency, growth if invested at the foreign interest rate, 
and then exchanged back to domestic currency at the forward rate. 


Equivalent relationships are as follows: 


forwards), (1+ Ty omestic) 
Spot, (1 > See) 
i-+-s, : 
forward, It (I + Taomestic) ot, hi 
; (1 Tiiateal / 
(i375) 
spot 4/¢ Fe forwards, P 


(i + T somestic) 


EXAMPLE: Calculating the arbitrage-free forward exchange rate 


Consider two currencies, the ABE and the DUB. The spot ABE/DUB exchange rate 
is 4.5671, the 1-year riskless ABE rate is 5%, and the 1-year riskless DUB rate is 
3%. What is the 1-year no-arbitrage forward exchange rate? 


Answer: 
feo i 
forward, /,, = spotyyo( z=) 
forward. j= 4.5671 (1-93) = 4.6558 ,/, 


Note that the forward rate is greater than the spot rate by 4.6558 / 4.5671 - 1= 
1.94%. This is approximately equal to the interest rate differential of 5% —- 3% = 
2%. The currency with the higher interest rate must depreciate over time by 
approximately the amount of the interest rate differential, to prevent arbitrage. 


If we are calculating a 30-, 90-, or 180-day forward exchange rate, we need to use 
interest rates for 30-, 90-, and 180-day periods rather than annual rates. Note that 
these shorter-term rates are quoted as money market yields. 


EXAMPLE: Calculating the arbitrage-free forward exchange rate with 90- 
day interest rates 
The spot ABE/DUB exchange rate is 4.5671, the 90-day riskless ABE rate is 5%, 


and the 90-day riskless DUB rate is 3%. What is the 90-day forward exchange rate 
that will prevent arbitrage profits? 


Answer: 
90 
1 + 0.05 | —- 
no-arbitrage forward = 4.5671 TUN Can) = 4.5671 a 
1+ 0.03 (er, 1.0075 
= 4.5898 ABE/DUB 


In our previous example, we calculated the no-arbitrage one-year forward ABE/DUB 
exchange rate as 4.6558. Next, we illustrate the arbitrage profit that could be gained 
if the forward exchange rate differs from this no-arbitrage rate. Consider a forward 
rate of 4.6000 so that the depreciation in the ABE is less than that implied by the no- 
arbitrage relationship. This makes the ABE attractive to a DUB investor who can 
earn a riskless profit as follows: 


» Borrow 1,000 DUB for one year at 3% to purchase ABE and get 4,567.1 ABE. 


= Invest the 4,567.1 ABE at the ABE rate of 5% to have 1.05(4,567.1) = 4,795.45 ABE 
at the end of one year. 


» Enter into a currency forward contract to exchange 4,795.45 ABE in one year at 
the forward rate of 4.6000 ABE/DUB to receive 4,795.45 / 4.6000 = 1,042.49 DUB. 


The investor has ended the year with a 4.249% return on his 1,000 DUB investment, 
which is higher than the 3% 1-year DUB interest rate. After repaying the 1,000 DUB 
loan plus interest (1,030 DUB), the investor has a profit of 1,042.49 — 1,030 = 12.49 
DUB with no risk and no initial out-of-pocket investment (i.e., a pure arbitrage 
profit). 


Arbitrageurs will pursue this opportunity, buying ABE (driving down the spot 
ABE/DUB exchange rate) and selling ABE forward (driving up the forward ABE/DUB 
exchange rate), until the interest rate parity relation is restored and arbitrage profits 
are no longer available. 


Note that the no-arbitrage forward exchange rate is approximately proportional to 
the annual interest rate differential and the time period of the forward contract. 


Calculating a Forward Exchange Rate Froma 
Forward Quote in Points or in Percentage Terms 


A forward exchange rate quote typically differs from the spot quotation and is 
expressed in terms of the difference between the spot exchange rate and the forward 
exchange rate. One way to indicate this is with points. The unit of points is the last 
decimal place in the spot rate quote. For a spot currency quote to four decimal 


places, such as 2.3481, each point is 0.0001, or 1/10,000th. A quote of +18.3 points 
for a 90-day forward exchange rate means that the forward rate is 0.00183 greater 
than the spot exchange rate. 


EXAMPLE: Forward exchange rates in points 


The AUD/EUR spot exchange rate is 0.7313 with the 1-year forward rate quoted 
at +3.5 points. What is the 1-year forward AUD/EUR exchange rate? 


Answer: 
The forward exchange rate is 0.7313 + 0.00035 = 0.73165. 


EXAMPLE: Forward exchange rates in percentage 


The AUD/EUR spot rate is quoted at 0.7313, and the 120-day forward exchange 
rate is given as -0.062%. What is the 120-day forward AUD/EUR exchange rate? 


Answer: 
The forward exchange rate is 0.7313 (1 - 0.00062) = 0.7308. 


While forward rates are typically quoted as forward points, one application where 
percentage forward quotes are useful is if we are interpreting a forward rate as the 
expected future spot rate, s,,,. We can write our no-arbitrage relation as follows: 


f-— f 
_ o f ad 
a 1 YoN Sti] 1+, 


Analysis of capital markets suggests that an increase in a country’s interest rate will 
attract foreign investment, which will lead to appreciation of the domestic currency 
(spot rate decreases). This is opposite to our conclusion from our no-arbitrage 
forward rate result that higher domestic rates are associated with a depreciation of 
the domestic currency. Historically, interest rate differences are poor predictors of 
future spot rates, although they may be unbiased and tend to get the direction of 
change correct. 


It’s best to think of the no-arbitrage forward rate as simply the forward rate at a 
point in time that prevents currency arbitrage, rather than the expected future spot 
rate. Our no-arbitrage relation holds at a point in time and does not address the 
question of how changes in interest rates affect spot exchange rates over time. 


Interpreting a Forward Discount or Premium 


The forward discount or forward premium for a currency is calculated relative 
to the spot exchange rate. The forward discount or premium for the base currency is 
the percentage difference between the forward price and the spot price. 


Consider the following spot and forward exchange rates: 
USD/EUR spot = $1.312 
USD/EUR 90-day forward = $1.320 


The (90-day) forward premium or discount on the euro = forward / spot — 1 = 1.320 
/ 1.312 - 1 = 0.610%. Because this is positive, it is interpreted as a forward premium 


on the euro of 0.610%. Because we have the forward rate for three months, we could 
annualize the discount simply by multiplying by 4 (= 12 / 3). 


Because the forward quote is greater than the spot quote, it will take more dollars to 
buy one euro 90 days from now, so the euro is expected to appreciate versus the 
dollar, and the dollar is expected to depreciate relative to the euro. 


If the forward quote were less than the spot quote, the calculated amount would be 
negative, and we would interpret that as a forward discount for the euro relative to 
the USS. dollar. 


=) MODULE QUIZ 19.1 


. 


1. 


Today’s spot rate for the Indonesian rupiah (IDR) is IDR/USD 2,400.00, and the 
New Zealand dollar trades at NZD/USD 1.6000. The NZD/IDR cross rate is: 

A. 0.00067. 

B. 1,492.53. 

C. 3,840.00. 


. The New Zealand dollar (NZD) is trading at USD/NZD 0.3500, and the Swedish 


krona (SEK) is trading at NZD/SEK 0.3100. The USD/SEK cross rate is: 
A. 0.1085. 
B. 8.8573. 
C. 9.2166. 


. The spot Swiss franc/British pound (CHF/GBP) exchange rate is 1.3050. In the 


180-day forward market, the CHF/GBP exchange rate is —42.5 points. The 180-day 
forward CHF/GBP exchange rate is closest to: 

A. 1.2625. 

B. 1.3008. 

C. 1.3093. 


. The spot rate on the New Zealand dollar (NZD) is NZD/USD 1.4286, and the 180- 


day forward rate is NZD/USD 1.3889. This difference means: 

A. interest rates are lower in the United States than in New Zealand. 

B. interest rates are higher in the United States than in New Zealand. 

C. it takes more NZD to buy one USD in the forward market than in the spot 
market. 


. The current spot rate for the British pound (GBP) in terms of U.S. dollars is 


$1.533 and the 180-day forward rate is $1.508. Relative to the pound, the dollar is 
trading closest to a 180-day forward: 

A. discount of 1.63%. 

B. premium of 1.66%. 

C. discount of 1.66%. 


. The annual interest rates in the United States (USD) and Sweden (SEK) are 4% 


and 7% per year, respectively. If the current spot rate is SEK/USD 9.5238, then 
the 1-year forward rate in SEK/USD is: 

A. 9.2568. 

B. 9.7985. 

C. 10.2884. 


. The annual risk-free interest rate is 10% in the United States (USD) and 4% in 


Switzerland (CHF), and the 1-year forward rate is USD/CHF 0.80. Today’s 
USD/CHF spot rate is closest to: 

A. 0.7564. 

B. 0.8462. 

C. 0.8888. 


KEY CONCEPTS 


LOS 19.a 


Given two exchange rate quotes for three different currencies, we can calculate a 
currency cross rate. If the MXN/USD quote is 12.1 and the USD/EUR quote is 1.42, 
we can calculate the cross rate of MXN/EUR as 12.1 x 1.42 = 17.18. 


LOS 19.b 


The condition that must be met so that there is no arbitrage opportunity available is 
as follows: 


1+1,. : 1+i.. 
eee = ; — currency) so that forward = spot x (U * *psice curency) 


+ Ros currency | (1 $ Siccsmsaanaa’ 


If the spot exchange rate for the euro is 1.25 USD/EUR, the euro interest rate is 4% 
per year, and the dollar interest rate is 3% per year, the no-arbitrage one-year 
forward rate can be calculated as follows: 


1.25 x (1.03 / 1.04) = 1.238 USD/EUR 


Points in a foreign currency quotation are in units of the last digit of the quotation. 
For example, a forward quote of +25.3 when the USD/EUR spot exchange rate is 
1.4158 means that the forward exchange rate is 1.4158 + 0.00253 = 1.41833 
USD/EUR. 


For a forward exchange rate quote given as a percentage, the percentage (change in 
the spot rate) is calculated as forward / spot - 1. A forward exchange rate quote of 
+1.787%, when the spot USD/EUR exchange rate is 1.4158, means that the forward 
exchange rate is 1.4158 (1 + 0.01787) = 1.4411 USD/EUR. 


If a forward exchange rate does not correctly reflect the difference between the 
interest rates for two currencies, an arbitrage opportunity for a riskless profit exists. 
In this case, borrowing one currency, converting it to the other currency at the spot 
rate, investing the proceeds for the period, and converting the end-of-period amount 
back to the borrowed currency at the forward rate will produce more than enough 
to pay off the initial loan, with the remainder being a riskless profit on the arbitrage 
transaction. 


To calculate a forward premium or discount for the base currency, use the following 
formula: 


(forward / spot) — 1 


Forward exchange rates are poor predictors of future exchange rates. Forward 
exchange rates should be interpreted as simply the no-arbitrage forward rates at a 
point in time, not as predictors of future spot rates. 


ANSWER KEY FOR MODULE QUIZZES 


Module Quiz 19.1 


1.A Start with one NZD and exchange for 1 / 1.6 = 0.625 USD. Exchange the USD for 
0.625 x 2,400 = 1,500 IDR. We get a cross rate of 1,500 IDR/NZD, or 1 / 1,500 = 


2.A 


3.B 


4.B 


5.B 


6.B 


7.A 


0.00067 NZD/IDR. (LOS 19.a) 


USD/NZD 0.3500 x NZD/SEK 0.3100 = USD/SEK 0.1085. Notice that the NZD 
term cancels in the multiplication. (LOS 19.a) 


The 180-day forward exchange rate is 1.3050 — 0.00425 = CHF/GBP 1.30075. 
(LOS 19.b) 


Interest rates are higher in the United States than in New Zealand. It takes 
fewer NZD to buy one USD in the forward market than in the spot market. (LOS 
19.b) 


To calculate a percentage forward premium or discount for the U.S. dollar, we 
use the dollar to be the base currency. The spot and forward quotes given are 
U.S. dollars per British pound (USD/GBP), so we must invert them to GBP/USD. 
The spot GBP/USD price is 1 / 1.533 = 0.6523 and the forward GBP /USD price 
is 1 / 1.508 = 0.6631. Because the forward price is greater than the spot price, 
we Say the dollar is at a forward premium of 0.6631 / 0.6523 - 1 = 1.66%. 
Alternatively, we can calculate this premium with the given quotes as spot / 
forward — 1 to get 1.533 / 1.508 - 1 = 1.66%. (LOS 19.b) 


The forward rate in SEK/USD is 9.5238 (4-07) 9.7985. Because the SEK interest 
rate is the higher of the two, the SEK must depreciate approximately 3%. (LOS 
19.b) 

We can solve for the spot rate as follows. With the exchange rates quoted as 
USD/CHE , the spot is 0.80(+-73) 0.7564. Because the interest rate is higher in 


the United States, it should take fewer USD to buy CHF in the spot market. In 
other words, the USD must be at a forward discount relative to the CHF. (LOS 
19.b) 


Topic Quiz: Economics 


You have now finished the Economics topic section. Please log into your Schweser 
online dashboard and take the Topic Quiz on this section. The Topic Quiz provides 
immediate feedback on how effective your study has been for this material. Questions 
are more exam-like than typical Module Quiz or QBank questions; a score of less than 
70% indicates that your study likely needs improvement. These tests are best taken 
timed; allow 1.5 minutes per question. 


FORMULAS 


Quantitative Methods 
FV =PV(1+2) 


PV= ee = FV(1+1)+t 
where: 
r = interest rate per compounding period 
t = number of compounding periods 
continuous compounding: 


FV = PV x c* 
PV = FV x e* 
PV of a perpetuity = Pavment 
annuity payment =; 7 =Y 
where: 


r = interest rate per period 

t = number of periods 

PV = present value (principal) 

D 

constant growth dividend discount model: Vp) = E 3 
e c 


D 
required rate of return: k, = v; +k. 


D 
implied growth rate: g. = k, — we 


Vo 
2x 


sample mean: X = =} 


range = maximum value — minimum value 


21%: — | 
mean absolute deviation = 1 ____ 
7 _? 
2X — x) 
1 -s2— 1 
sample variance: s aa 
a ary 
Ad? 3) 
sample standard deviation: s = +. a | 


Sx _ standard deviation of x 


coefficient of variation = = 
¢ average value of x 


target downside deviation: 
a 


(X;-B)? 
a all X.<B 


Starget n—-1 
where B = target value 


sample covariance: 


$([x.-¥][%.-¥]} 


Sxy= n—1 

where: 

X; = an observation of variable X 
Y; = an observation of variable Y 
X = mean of variable X 

Y = mean of variable Y 

n = number of periods 


correlation coefficient: p,.-= ixSy 


P(O|T) 
] . —_ 
: : . _ a COVyy 
slope coefficient for a simple linear regression: b,; = > 
o 
x 


intercept term for a simple linear regression: 


a ——- 
total sum of squares (SST): SST = 2%; - yy 
i= 
> = 2 
sum of squares regression (SSR): SSR = dy, — Y) 
i=1 


—~ 2 
sum of squared errors (SSE): SSE = )_(Y; — Y) 
= 


coefficient of determination: R? = SSR / SST 


F-statistic: 
F= MSR = SSR/k 
MSE  SSE/n-—k-1 
where: 


MSR = mean regression sum of squares 
MSE = mean squared error 


Economics 


CPI 
real P/B exchange rate = nominal P/B exchange rate x (ep 


arbitrage-free forward exchange rate relationship: —————_ = 


APPENDICES 


APPENDIX A: AREAS UNDER THE NORMAL 
CURVE 
Most of the examples in this book have used one version of the z-table to find the 


area under the normal curve. This table provides the cumulative probabilities (or 
the area under the entire curve to left of the z-value). 


Probability Example 


Assume that the annual earnings per share (EPS) for a large sample of firms is 
normally distributed with a mean of $5.00 and a standard deviation of $1.50. What is 


the approximate probability of an observed EPS value falling between $3.00 and 
$7.25? 


If EPS = x = $7.25, then z = (x — p)/o = ($7.25 — $5.00)/$1.50 = +1.50 
If EPS = x = $3.00, then z = (x — u)/o = ($3.00 — $5.00)/$1.50 = —1.33 


Solving Using the Cumulative Z-Table 


For z-value of 1.50: Use the row headed 1.5 and the column headed 0 to find the 
value 0.9332. This represents the area under the curve to the left of the critical value 
1.50. 


For z-value of -1.33: Use the row headed 1.3 and the column headed 3 to find the 
value 0.9082. This represents the area under the curve to the left of the critical value 
+1.33. The area to the left of -1.33 is 1 - 0.9082 = 0.0918. 


The area between these critical values is 0.9332 - 0.0918 = 0.8414, or 84.14%. 


Hypothesis Testing—One-Tailed Test Example 


A sample of a stock’s returns on 36 nonconsecutive days results in a mean return of 
2.0%. Assume the population standard deviation is 20.0%. Can we say with 95% 
confidence that the mean return is greater than 0%? 
Hy: up < 0.0%, H,: up > 0.0%. The test statistic = z-statistic 
Ho 
o/ND 


(2.0 — 0.0) / (20.0 / 6) = 0.60 


The significance level = 1.0 — 0.95 = 0.05, or 5%. Because we are interested in a 
return greater than 0.0%, this is a one-tailed test. 


Using the Cumulative Z-Table 


Because this is a one-tailed test with an alpha of 0.05, we need to find the value 0.95 
in the cumulative z-table. The closest value is 0.9505, with a corresponding critical 
z-value of 1.65. Because the test statistic is less than the critical value, we fail to 
reject Ho. 


Hypothesis Testing—Two-Tailed Test Example 


Using the same assumptions as before, suppose that the analyst now wants to 
determine if he can say with 99% confidence that the stock’s return is not equal to 
0.0%. 


Ho: p = 0.0%, H,: p + 0.0%. The test statistic (z-value) = (2.0 - 0.0) / (20.0 / 6) = 0.60. 


The significance level = 1.0 - 0.99 = 0.01, or 1%. Because we are interested in 
whether or not the stock return is nonzero, this is a two-tailed test. 


Using the Cumulative Z-Table 


Because this is a two-tailed test with an alpha of 0.01, there is a 0.005 rejection 
region in both tails. Thus, we need to find the value 0.995 (1.0 — 0.005) in the table. 
The closest value is 0.9951, which corresponds to a critical z-value of 2.58. Because 
the test statistic is less than the critical value, we fail to reject Ho and conclude that 


the stock’s return equals 0.0%. 


CUMULATIVE Z-TABLE 


Standard Normal Distribution 


P(Z)sz 


N(z) forz >0 


P(Z < z) 
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Standard Normal Distribution 


P(Z =z) 


P(Z < z)= N(z) forz>0 z 


| z | 0.00 | 0.01 | 0.02 | 0.03 | 0.04 | 0.05 | 0.06 | 0.07 | 0.08 | 0.09 | 
| 0.0 | 0.5000 | 0.4960 | 0.4920 | 0.4880 | 0.4840 | 0.4801 | 0.4761 | 0.4721 | 0.4681 | 0.4641 | 
|-0.1| 0.4602 | 0.4562 | 0.4522 | 0.4483 | 0.4443 | 0.4404 | 0.4364 | 0.4325 | 0.4286 | 0.4247 | 
|-0.2| 0.4207 | 0.4168 | 0.4129 | 0.4090 | 0.4052 | 0.4013 | 0.3974 | 0.3936 | 0.3897 | 0.3859 | 
|-0.3| 0.3821 | 0.3783 | 0.3745 | 0.3707 | 0.3669 | 0.3632 | 0.3594 | 0.3557 | 0.3520 | 0.3483 | 
|-0.4| 0.3446 | 0.3409 | 0.3372 | 0.3336 | 0.3300 | 0.3264 | 0.3228 | 0.3192 | 0.3156 | 0.3121 | 
a [_ ie aa 


| | Le LC | 
|-0.5| 0.3085 | 0.3050 | 0.3015 | 0.2981 | 0.2946 | 0.2912 | 0.2877 | 0.2843 | 0.2810 | 0.2776 | 
ee eee pers ene eed Cee 
9.71 0.2420 | 0.2389 10.2358 | 0.2327 10.2297 | 0.2266 | 0.2236 [0.2207 | 0.2177 0.2148. 
as 
filige 0 
0.0869 | 0. 
{ __t- 


0.0392 | 0. 
| 0.0314 | 0. 
) | 0.0250 | 0. 
ce ee a ss ae 
|=2.0| 0.0228 | 0.0222 | 0.0217 | 0 
35100179 0.0174 0.0170 | 0.0166 [0.0162 [0.0158 [0.0154 [0.0150 [0.0146 [0.0143 
|=2.2| 0.0139 | 0.0136 | 0.0132 | 0.0129 | 0.0125 | 0.0122 | 0.0119 | 0.0116 | 0.0113 | 0.0110 | 
|-2.3] 0.0107 | 0.0104 | 0.0102 | 0.0099 | 0.0096 | 0.0094 | 0.0091 | 0.0089 | 0.0087 | 0.0084 | 
Ep 
a (a ee ee <a 

|-2.5| 0.0062 | 0.0060 | 0.0059 | 0.0057 | 
370.0035 [0.0034 [0.0033 [0.0032 [0.0031 [0.0030 [0.0029 [ 0.0028 [ 0.0027 [ 0.0026" 
|=2.8| 0.0026 | 0.0025 | 0.0024 | 0.0023 | 0.0023 | 0.0022 | 0.0021 | 0.0021 | 0.0020 | 0.0019 | 
|=2.9[ 0.0019 | 0.0018 | 0.0018 | 0.0017 | 0.0016 | 0.0016 | 0.0015 | 0.0015 | 0.0014 | 0.0014 | 
|-3.0/ 0.0013 | 0.0013 | 0.0013 | 0.0012 | 0.0012 | 0.0011 | 0.0011 | 0.0011 | 0.0010 | 0.0010 | 


APPENDIX B: STUDENT'S T-DISTRIBUTION 
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| df {| 0.100 | 0.050 | 0.025 | 0.01 | 0.005 | 0.0005 


Level of Significance for Two-Tailed Test 


| df | 0.20 | 0.10 | 0.001 
| 1 | 3.078 | 6.314 | 12.706 31.821 63.057 
| 2 | «1886 | 2920 | 4303 | 6965 | 9.925 31.599 
| 3) 1.638 | 2.353 | 4541 | 5841 
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APPENDIX C: F-TABLE AT 5% (UPPER TAIL) 
F-Table, Critical Values, 5% in Upper Tail 


Degrees of freedom for the numerator along top row 


Degrees of freedom for the denominator along side row 


| {[il{2{/3 [a ]s5]o6[71| 8] 9 | 10] 12] 15 | 20 | 24 | 30| 40 
| 1 | 161 | 200 | 216 | 225 | 230 | 234 | 237 | 239 | 241 | 242 | 244 | 246 | 248 | 249 | 250 | 251 | 
sat Eaaticiaea eenraote CaEineaaa eel Pleciedeall [ade eiaala Lane 
| 3 | 10.1| 9.55 |9.28|9.12|9.01 | 8.94 | 8.89 | 8.85 | 8.81 | 8.79 | 8.74 | 8.70 | 8.66 | 8.64 | 8.62 | 8.59 | 
| 4 {7.71 | 6.94| 6.59 | 6.39 | 6.26 | 6.16 | 6.09 | 6.04 | 6.00 | 5.96 | 5.91 | 5.86 | 5.80 | 5.77 | 5.75 | 5.72 | 
“Joni |S 79] sat [s19]s05]495|ass]aa) [477 [4 74] 408 [402 [aso] a 53/450 ado) 

ee Ee ee ee ee ee ee ee ee ee 

L6 |5.99|5.14| 4.76 |4.53 | 4.39 | 4.28 | 4.21 | 4.15 | 4.10 | 4.06 | 4.00 | 3.94 | 3.87 | 3.84 | 3.81 | 3.77, 
| 7 [5.59 | 4.74] 4.35 |4.12 |3.97 |3.87 | 3.79 | 3.73 | 3.68 | 3.64 |3.57 | 3.51 [3.44 | 3.41 | 3.38 | 3.34 | 
| 8 | 5.32 | 4.46 | 4.07 | 3.84 | 3.69 | 3.58 | 3.50 | 3.44 | 3.39 | 3.35 | 3.28 | 3.22 | 3.15 | 3.12 | 3.08 | 3.04. 
| 9 | 5.12 | 4.26 | 3.86 | 3.63 | 3.48 | 3.37 | 3.29 | 3.23 | 3.18 | 3.14 | 3.07 | 6.01 | 2.94 | 2.90 | 2.86 | 2.83 | 
| 10 | 4.96 | 4.10 | 3.71 | 3.48 | 3.33 | 3.22 | 3.14 | 3.07 | 3.02 | 2.98 | 2.91 | 2.85 | 2.77 | 2.74 | 2.70 | 2.66 | 
ee et i) 


a ae a 
| 11 | 4.84 | 3.98 | 3.59 | 3.36 [3.20 | 3.09 | 3.01 | 2.95 | 2.90 | 2.85 | 2.79 |2.72 | 2.65 | 2.61 | 2.57 | 2.53 | 
| 12 | 4.75 | 3.89 | 3.49 | 3.26 | 3.11 | 3.00 | 2.91 | 2.85 | 2.80 | 2.75 | 2.69 | 2.62 |2.54 | 2.51 | 2.47 | 2.43 | 
| 13 | 4.67 | 3.81 | 3.41 | 3.18 | 3.03 |2.92 | 2.83 | 2.77 | 2.71 | 2.67 | 2.60 | 2.53 |2.46 | 2.42 | 2.38 | 2.34 | 
| 14 | 4.60 | 3.74 | 3.34 | 3.11 | 2.96 |2.85 | 2.76 | 2.70 | 2.65 | 2.60 | 2.53 |2.46 | 2.39 | 2.35 | 2.31 | 2.27 | 

at $38 [32930 [290[2 79/271 [at 59[254[248 [240 [2 33]229|225 [220 
ee ee eee 
[16 | 4.49 | 3.63 | 3.24 | 3.01 |2.85 |2.74 | 2.66 | 2.59 | 2.54 | 2.49 |2.42 |2.35 |2.28 | 2.24 |2.19 [2.15 | 
17 | 4.45 | 3.59 | 3.20 | 2.96 |2.81 |2.70 | 2.61 | 2.55 |2.49 | 2.45 | 2.38 | 2.31 |2.23 | 2.19 |2.15 |2.10 | 

4.41 | 3.55 | 3.16 |2.93 |2.7 | 2.66 | 2.58 | 2.51 |2.46 |2.41 |2.34 |2.27 [2.19 | 2.15 | 2.11 |2.06 | 
[19 | 4.38 | 3.52 | 3.13 [2.90 |2.74 |2.63 | 2.54 | 2.48 | 2.42 | 2.38 [2.31 | 2.23 |2.16 | 2.11 | 2.07 | 2.03 | 
| 20 | 4.35 | 3.49 | 3.10 [2.87 |2.71 |2.60 | 2.51 |2.45 |2.39 | 2.35 |2.28 | 2.20 |2.12 | 2.08 | 2.04 | 1.99 | 
es ee ee ee ee 
| 21 | 4.32 | 3.47 |3.07 | 2.84 | 2.68 |2.57 | 2.49 | 2.42 | 2.37 | 2.32 |2.25 |2.18 |2.10 | 2.05 |2.01 |1.96 | 
| 22 | 4.30 |3.44 [3.05 |2.82 |2.66 |2.55 | 2.46 |2.40 | 2.34 |2.30 |2.23 |2.15 |2.07 | 2.03 | 1.98 | 1.94 | 
| 23 | 4.28 | 3.42 |3.03 | 2.80 |2.64 |2.53 | 2.44 |2.37 |2.32 | 2.27 | 2.20 | 2.13 | 2.05 | 2.01 | 1.96 | 1.91 | 
| 24 | 4.26 | 3.40 | 3.01 | 2.78 | 2.62 | 2.51 | 2.42 | 2.36 | 2.30 | 2.25 | 2.18 | 2.11 | 2.03 | 1.98 | 1.94 | 1.89 | 
| 25 | 4.24 | 3.39 | 2.99 | 2.76 | 2.60 | 2.49 | 2.40 | 2.34 | 2.28 | 2.24 | 2.16 | 2.09 | 2.01 | 1.96 | 1.92 | 1.87, 

Pe EE se ee a ee ee 
| 30 | 4.17 | 3.32 | 2.92 | 2.69 | 2.53 | 2.42 | 2.33 | 2.27 | 2.21 | 2.16 | 2.09 | 2.01 | 1.93 | 1.89 | 1.84] 1.79 | 
| 40 | 4.08 | 3.23 | 2.84 | 2.61 | 2.45 | 2.34 | 2.25 | 2.18 | 2.12 | 2.08 | 2.00 | 1.92 | 1.84] 1.79 | 1.74 | 1.69 | 
pi od Ba 26128 RAT 2S RATA 08 29182 1.84 17S LDL 138, 
|120| 3.92 | 3.07 | 2.68 | 2.45 | 2.29 | 2.18 | 2.09 | 2.02 | 1.96 | 1.91 | 1.83 | 1.75 | 1.66 | 1.61 | 1.55. 


re: [38e[ 30 [20 [237] 224 [210201] 194[ 1s [14s [75 |tor[ts7|1s2] tas] 


APPENDIX D: F-TABLE AT 2.5% (UPPER TAIL) 
F-Table, Critical Values, 2.5% in Upper Tails 


Degrees of freedom for the numerator along top row 


Degrees of freedom for the denominator along side row 


| [i{t2{3]{4]{5]{[6]{7]s8s {9 | 10] 12 | 15 | 20 | 2 | 30 | 40 | 
| 1 | 048 | 799 | 864 | 900 | 922 | 937 | 948 | 957 | 963 | 969 | 977 | 985 | 993 | 997 | 1001 | 1006 | 
pe ee 
| 3 |17.44| 16.04| 15.44] 15.10] 14.88 | 14.73 | 14.62| 14.54| 14.47] 14.42| 14.34] 14.25] 14.17| 14.12] 14.08 | 14.04 
ra [i272] 10-65] 998 | 960 [9.36 [9201907 [898 [890384] 875| 806] 855] 851 | 840 [8a | 

| 5 |10.01| 8.43 | 7.76 | 7.39 | 7.15 | 6.98 | 6.85 | 6.76 | 6.68 | 6.62 | 6.52 | 6.43 | 6.33 | 6.28 | 6.23 | 6.18 | 
mp pp pp 
Se eee ee 3 | 5.99 | 5.82 | 5.70 | 5.60 | 5.52 | 5.46 | 5.37 | 5.27 | 5.17 | 5.12 | 5.07 | 5.01 | 
2 | 5.29 | 5.12 | 4.99 | 4.90 | 4.82 | 4.76 | 4.67 | 4.57 | 4.47 | 4.41 | 4.36 | 4.31 | 
of aer] aes [ass [aes [450] 450] 420410 400] 3951 308 [3 
2 | 4.48 | 4.32 | 4.20 | 4.10 | 4.03 | 3.96 | 3.87 | 3.77 | 3.67 | 3.61 | 3.56 | 3.51 | 
SE as Gees Ge Ge 

a ee ee ee ee ee 


ee ae ee ee 

| 3.88 | 3.76 | 3.66 | 3.59 | 3.53 | 3.43 | 3.33 | 3.23 | 3.17 | 3.12 | 3.06 | 

5.10 2 [3.89 [3.73 | 3.61 | 3.51 | 3.44 | 3.37| 3.28 | 3.18 | 3.07 | 3.02 | 2.96 | 2.91, 

ra feat [497 [ass] amo arr aol sass oo ani [aasTais [aos [29s [oso [ose [a 
| 14 | 6.30 | 486 | 4.24 | 3.89 | 3.66 | 3.50 | | 3.21 | 3.15 | 3.05 | 2.95 | 2.84 | 2.79 | 2.73 | 2.67 | 
soap air sis se {3se {a1 { 29320312] e[29e[ zee[ 2 0o| 270 | 24239 


Eel eae 
PAAR CAETARYARCAEY AEP ARTARUAPCAPTAPTART APT OPTAPETE 
| 17 | 6.04 | 4.62 | 4.01 | 3.66 | 3.44 | 3.28 | 3.16 | 3.06 | 2.98 | 2.92 | 2.82 | 2.72 | 2.62 | 2.56 | 2.50 | 2.44 | 
a 
| 19 | 5.92 | 4.51 | 3.90 | 3.56 | 3.33 | 3.17 | 3.05 | 2.96 | 2.88 | 2.82 | 2.72 | 2.62 | 2.51 | 2.45 | 
| 20 | 5.87 | 4.46 | 3.86 | 3.51 | 3.29 | 3.13 | 3.01 | 2.91 | 2.84 | 2.77 | 2.68 | 2.57 | 2.46 | 2.41 | 
a a 
| 21 | 5.83 | 4.42 | 3.82 | 3.48 | 3.25 | 3.09 | 2.97 | 2.87 | 2.80 | 2.73 | 2.64 | 2.53 | 2.42 | 2.37 | 
2215.79 | 4381 3.78 | 3.44 | 3.22 | 3.05 | 2.98 | 2.84 | 2.76 1 2.701 2.60 | 2.50} 2.99 | 2.53 
| 23 | 5.75 | 4.35 | 3.75 | 3.41 | 3.18 | 3.02 | 2.90 | 2.81 | 2.73 | 2.67 | 2.57 | 2.47 | 2.36 | 2.30 | 
| 24 | 5.72 | 4.32 | 3.72 | 3.38 | 3.15 | 2.99 | 2.87 | 2.78 | 2.70 | 2.64 | 2.54 | 2.44 | 2.33 | 2.27 | 
25 | 5.69 | 4.29 | 3.69 | 3.35 | 3.13 | 2.97 | 2.85 | 2.75 | 2.68 | 2.61 | 2.51 | 2.41 | 2.30 | 2.24 | 

AEs aa) MR ARENT ESRD) RE) (A (AM SG HR 


APPENDIX E: CHI-SQUARED TABLE 


Values of x? (Degrees of Freedom, Level of Significance) 


Probability in Right Tail 


TEEiow | 099 | 0975 | oss | 09 | 01 | 00s | 0.5 | oot | 0.005 | 
Freedom 


[| 1 _{ 0.000157 | 0.000982 | 0.003932 | 0.0158 | 2.706 | 3.841 | 5.024 | 6.635 | 7.879 | 
| 2 ‘| 0.020100 | 0.050636 | 0.102586 | 0.2107 | 4.605 | 5.991 | 7.378 | 9.210 | 10.597 | 
| 3 | 0.1148 | 0.2158 | 0.3518 | 0.5844 | 6.251 | 7.815 | 9.348 | 11.345 | 12.838 | 
a ST ETE Ee ED 
=f 


ar sar hes er ho 

pte } Lo tater fos orf leo} ott fas. 5 P2028 
A RCRA BER TMT 
ST tes 1-9 TOOTS oss fais ose [16.919 T9003 [21-656 13 585+ 
90 }-3.558-[ 3347 3.940 | 4805115987 [18-307 [20.485 [23.209 [25 188 
et 

| 3.053 | 3.816 | 4575 | 5.578 | 17.275 | 19.675 | 21.920 | 24.725 | 26.757 | 
Se STO RECREATE COO 
| 13] 4.107 | 5.009 | 5.892 | 7.041 | 19.812 | 22.362 | 24.736 | 27.688 | 29.819 | 
| 14 | 4.600 | 5.629 | 6.571 | 7.790 | 21.064 | 23.685 | 26.119 | 29.141 | 31.319 | 
| 15 | 5.229 | 6.262 | 7.261 | 8.547 | 22.307 | 24.996 | 27.488 | 30.578 | 32.801 | 
po 


| 16 | 5.812 | 6908 | 7.962 | 9.312 | 23.542 | 26.296 | 28.845 | 32.000 | 34.267 | 
| 17S] 6.408 | 7.564 | 8.672 | 10.085 | 24.769 | 27.587 | 30.191 | 33.409 | 35.718 | 
| 18 | 7.015 | 8.231 | 9.390 | 10.865 | 25.989 | 28.869 | 31.526 | 34.805 | 37.156 | 
| 9 | 7.633 | 8.907 | 10.117 | 11.651 | 27.204 | 30.144 | 32.852 | 36.191 | 38.582 | 
| 20 | 8.200 | 9.591 | 10.851 | 12.443 | 28.412 | 31.410 | 34.170 | 37.566 | 39.997 | 
eed eel ee eS ee LS 


Ee ee ee 
| 2a | 8.897 | 10.283 | 11.591 | 13.240 | 29.615 | 32.671 | 35.479 | 38.932 | 41.401 | 
a ef a ee se 
| 23 | 10.196 | 11.689 | 13.091 | 14.848 | 32.007 | 35.172 | 38.076 | 41.638 | 44.181 | 
nse ant | iss 115059 9 eas [394 [om | 
A 


3G Ta Tae es as 
[| 27 | 12.878 | 14.573 | 16.151 | 18.114 | 36.741 | 40.113 | 43.195 | 46.963 | 49.045 | 
| 28 | 13.565 | 15.308 | 16.928 | 18.939 | 37.916 | 41.337 | 44.461 | 48.278 | 50.994, 
[| 29 |: 14.256 | 16.047 | 17.708 | 19.768 | 39.087 | 42.557 | 45.722 | 49.588 | 52.335 | 
| 30 | 14.953 | 16.791 | 18.493 | 20.599 | 40.256 | 43.773 | 46.979 | 50.892 | 53.672 | 
ey GE ieeaiees) Pee ees Cee ees ee es 
| 50s 29.707 | 32.357 | 34.764 | 37.689 | 63.167 | 67.505 | 71.420 | 76.154 | 79.490 | 
| 60 | 37.485 | 40.482 | 43.188 | 46.459 | 74.397 | 79.082 | 83.298 | 88.379 | 91.952 | 

te ee ee 
| 100 | 70.065 | 74222 | 77.929 | 82.358 | 118.498] 124.342| 129.561 | 135.807] 140.170] 


INDEX 


A 


action lag, 177 

additivity principle, 24 
after-tax nominal return, 11 
algorithmic trading, 132 
alternative data, 129 
alternative hypothesis, 84 
amortizing bond, 20 
analysis of variance (ANOVA), 117 
annualized return, 9 
annuity, 21 

arithmetic mean, 4, 33 
arithmetic mean return, 3 
artificial intelligence, 130 
autarky, 197 

automatic stabilizers, 172 


B 


base currency, 216 

Bayes’ formula, 54 

biased estimator, 37 

Big Data, 129 

bilateralism, 197 

bimodal, 34 

binomial model, 28 

black swan risk, 199 

bond market vigilantes, 189 
bootstrap resampling, 71, 79 
box and whisker plot, 35 
breakeven point, 138 
budget deficit, 171 

budget surplus, 171 
business cycle, 161 

buy side, 216 


C 


capital restrictions, 209 


capital spending, 174 

cartel, 149 

cash flow additivity principle, 24 
central limit theorem, 77 
classical cycle, 161 

cluster sampling, 76 

coefficient of determination, 119 
coefficient of variation (CV), 38 
coincident indicators, 166 
common stock, 22 

compounding frequency, 10 
concentration measures, 153 
conditional expected value, 53 
constant growth DDM, 22 
consumer spending, 163 
contraction, 161 

contractionary monetary policy, 171 
convenience sampling, 77 
conventional fixed peg arrangement, 220 
corporate exhaust, 130 
corporations, 216 

correlation, 46, 62 

cost averaging, 4 

coupon rate, 20 

Cournot model, 148 

covariance, 45, 59 

covariance matrix, 60 

crawling peg, 221 

credit cycles, 162 

critical value, 85 

cross rate, 227 

crowding-out effect, 173, 177 
currency board arrangement, 220 
current spending, 174 

cyclically adjusted budget deficit, 178 


D 


data science, 130 
debt ratio, 172 
decile, 35 

decision rule, 85, 87 
deep learning, 131 
default risk, 2 


degrees of freedom (df), 119 
dependent variable, 109 

differences between means, 89 
direct quote, 216 

direct taxes, 174 

discount rate, 1 

discretionary fiscal policy, 172, 176 
diseconomies of scale, 141 
dispersion, 36 

disposable income, 175 

dividend discount model (DDM), 22 
dividend yield, 24 

dominant firm model, 149 
downside risk, 39 

durable goods, 164 


1 


economic tools, 200 
equality of variances, 97 
equity securities, 21 

event risk, 199 

excess kurtosis, 43 
exchange rate, 216 
exchange rate regimes, 220 
exchange rate targeting, 188 
exogenous risk, 199 
expansion, 161 
expansionary monetary policy, 171 
expected value, 51 

export subsidy, 207, 208 


F 


F-distribution, 97 

fiat money, 183 

financial tools, 200 

fintech, 129 

fiscal multiplier, 175 

fiscal policy, 171 

fiscal policy tools, 173 
fixed-coupon bond, 20 

foreign direct investment, 197 
formal dollarization, 220 


forward currency contract, 215 
forward discount, 231 

forward exchange rate, 218 
forward premium, 231 


G 


geometric mean, 4 
geometric mean return, 3 
geophysical resource endowment, 196 
geopolitical risk, 199 
geopolitics, 195 
globalization, 196 
Gordon growth model, 22 
government entities, 216 
gross return, 11 

growth cycle, 161 
growth rate cycle, 162 


H 


harmonic mean, 4, 5 

hedging, 215 

hegemony, 197 

Herfindahl-Hirschman Index (HHI), 153 
heteroskedasticity, 115 

high-frequency trading, 132 

holding period return (HPR), 2 
homoskedasticity, 115 

hypothesis, 83 

hypothesis testing steps, 83 


I 


impact lag, 177 

impact of geopolitical risk, 200 

implied forward rate, 26 

implied growth rate, 24 

independently and identically distributed. 70 
independently floating exchange rate, 221 
independent variable, 109 

indirect quote, 216 

indirect taxes, 174 

inflation premium, 2 


inflation reports, 188 

inflation targeting, 188 

institutions, 196 

interest rate targeting, 188 

internal rate of return (IRR), 5 
International Monetary Fund (IMF), 198 
Internet of Things, 130 

interquartile range, 35 

inventory-sales ratio, 163 

investment accounts, 216 


J 


jackknife resampling, 79 
judgmental sampling, 77 


K 


kinked demand curve model, 146 
kurtosis, 43 


L 


lagging indicators, 166 

latency, 130 

leading indicators, 166 

legal tender, 183 

leptokurtic, 43 

leveraged accounts, 216 

leveraged return, 12 

likelihood of geopolitical risk, 199 
lin-log model, 124 

liquidity risk, 2 

liquidity trap, 189 

location of the mean, median, and mode, 41 
log-lin model, 124 

log-log model, 124 

lognormal distribution, 69 

long run, 137 

long-run shutdown point, 138 


M 


machine learning, 130 


managed floating exchange rates, 221 
management of exchange rates within crawling bands, 221 
maturity risk, 2 

mean absolute deviation (MAD), 36 
mean differences, 92 

mean squared error (MSE), 118 

mean square regression (MSR), 117 
measures of central tendency, 33 
median, 33 

menu costs, 184 

mesokurtic, 43 

minimum domestic content requirement, 207 
minimum efficient scale, 141 

modal interval, 34 

mode, 34 

monetary policy, 171 

monetary policy tools, 184 

monetary transmission mechanism, 185 
monetary union, 220 

money-weighted return, 5, 6 
monopolistic competition, 143 
monopoly, 143 

Monte Carlo simulation, 70 
multilateralism, 197 

multistage DDM, 22 


N 


Nash equilibrium, 148 
nationalism, 196 

national security tools, 200 
natural language processing, 131 
negative skew, 41 

net present value (NPV), 5 

net return, 11 

neural networks, 130 

N-firm concentration ratio, 153 
no-arbitrage principle, 25 
nominal exchange rate, 217 
nondurable goods, 164 
nonparametric test, 99 
nonprobability sampling, 75 
nonstate actors, 195 

null hypothesis, 84 


O 


objective of a central bank, 184 
oligopoly, 143 

one-stage cluster sampling, 76 
operational independence, 187 
ordinary least squares (OLS), 111 
outlier, 33, 41, 116 

overfitting, 131 


P 


paired comparisons test, 91, 92 
parametric test, 99 

par value, 21 

peak, 161 

pegging, 184 

percentile, 35 

perfect competition, 143 
perpetual bond, 20 
perpetuity, 20 

physical capital, 163 
platykurtic, 43 

portfolio investment flows, 197 
portfolio variance, 61 
positive skew, 41 

power of a test, 86 

predicted values, 121 
preferred stock, 21 

premium instrument, 19 
pretax nominal return, 11 
price currency, 216 

price relative, 11 

probability sampling, 75 
probability tree, 53 
properties of covariance, 60 
pure discount instrument, 19 
p-value, 87 


Q 


quantile, 35 
quantitative easing, 190 
quartile, 35 


quintile, 35 
quota, 207 
quota rents, 208 


R 


random sampling, 75 

range, 36 

real exchange rate, 217 

real money accounts, 216 
real return, 11 

real risk-free rate, 1 
recession, 161 

recognition lag, 177 
recovery, 162 

regionalism, 197 

regional trading agreement (RTA), 210 
regression line, 111 

relative dispersion, 38 
replication, 25 

required rate of return, 1, 21 
resampling, 71 

residual, 110 

retail FX market, 216 
revenue tools, 173 
Ricardian equivalence, 176 
roles of central banks, 183 
Roy’s safety-first criterion, 63 


S 


sample covariance, 45, 60 
sample kurtosis, 44 
sample mean, 33 

sample skewness, 42 
sample standard deviation, 38 
sample variance, 37 
sanctions, 200 

scatter plots, 44 

scenario analysis, 200 
sell side, 216 

services, 164 

shoe leather costs, 184 
shortfall risk, 63 


short run, 137 

short-run shutdown point, 138 
significance level, 86 

signposts, 200 

simple linear regression, 109 

simple random sampling, 75, 76 
skew, skewness, 41 

slope coefficient, 111 

soft power, 196 

source of variation, 119 

sovereign wealth funds, 216 
Spearman rank correlation test, 104 
spending tools, 173 

spot exchange rate, 218 

spurious correlation, 47 

Stackelberg model, 148 

standard deviation, 38 

standard error of the sample mean, 78 
standardization, 196 

state actors, 195 

stratified random sampling, 76 
structural budget deficit, 178 

sum of squared errors (SSE), 111, 118 
sum of squares regression (SSR), 117, 118 
supervised learning, 131 

systematic sampling, 76 


T 


target downside deviation, 39 
target independence, 187 
target semideviation, 39 
target zone, 220 

tariff, 207 

test statistic, 85 

text analytics, 131 

thematic risk, 199 

threshold level, 63 

time preference, 1 
time-weighted rate of return, 7 
tools of geopolitics, 200 

total sum of squares (SST), 117 
trading blocs, 210 

transfer payments, 173 


trimmed mean, 5, 35 

trimodal, 34 

trough, 161 

two-stage cluster sampling, 76 
Type I error, 86 

Type II error, 86 


U 


underfitting, 131 
unimodal, 34 
unsupervised learning, 131 


Vv 


variance (from a probability model), 52 

velocity of geopolitical risk, 199 

volatility, 52 

voluntary export restraint, 207 

voluntary export restraint (VER), 208 
W 


winsorized mean, 5, 35 
World Bank, 198 
World Trade Organization (WTO), 198 


Y 


yield to maturity, 19 


Z 


zero-coupon bond, 19 


